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ABSTRACT 

There  is  a  growing  need  within  the  Na\ry  for  methods  of  detecting 
discrete  narrowband  signals  in  a  non-stationary  background.   This  paper 
concerns  itself  with  the  application  of  digital  processing  and  spectral 
analysis  techniques  toward  that  goal.   The  use  of  the  fast  Fourier 
Transform  in  estimating  the  power  spectrum  of  a  signal  is  described. 
The  method  involves  sectioning  the  time  record,  making  "raw"  estimates 
of  the  spectrum  from  these  sections,  and  averaging  these  "raw"  estimates. 
It  is  shown  that  more  stable  estimates  are  available  if  the  segments  are 
overlapped  and  an  optimum  amount  of  overlap  for  the  case  of  the  Manning 
Window  is  found.   It  is  shown  that  the  stability  of  these  spectral  esti- 
mates can  be  interpreted  as  processing  gain  in  the  case  of  a  discrete 
narrowband  signal  in  additive  noise.   And  finally,  a  brief  description  of 
signal  detection  theory  applied  to  a  human  observer  is  presented  to  empha- 
size the  flexibility  that  a  human  operator  can  bring  to  a  signal  detection 
system. 
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I.  INTRODUCTION 

Today's  ASW  signal  processing  systems  continue  to  grow  in  complexity 
and  while  many  functions  will  be  automated  in  systems  now  under  develop- 
ment, the  human  operator  will  continue  to  play  a  key  role  in  the  proces- 
sing of  data  and  the  making  of  decisions.  Man  provides  a  flexibility  to 
the  system  that  is  not  available  through  complete  automation  and  it  is, 
therefore,  imperative  that  he  be  considered  an  integral  part  of  the 
system. 

In  the  past,  assumptions  have  been  made  regarding  the  requirements 
that  could  be  imposed  upon  the  operator,  without  any  precise  knowledge 
of  how  an  operator's  performance  actually  would  be  influenced  by  the  sus- 
tained demands  of  a  system.   In  general,  it  apparently  has  been  assumed 
by  system  designers  that  the  operator  was  like  any  of  the  hardware  com- 
ponents that  go  into  a  system,  i.e.  that  he  would  continue  to  perform  at 
a  level  throughout  a  mission  approximately  equal  to  that  at  which  he 
began.   Common  sense  does  not  support  this  assumption,  but  to  assume 
otherwise  introduced  difficult  problems  for  which  no  useful  answers 
were  available.   It  seemed  wise,  or  at  least  expedient,  to  let  the 
characteristics  of  the  electronics  determine  the  direction  of  design  and 
the  development  of  procedures  for  the  use  of  the  system,  with  only  casual 
consideration  of  the  characteristics  of  the  operator.   However,  studies 
of  the  role  of  the  operator  indicate  that  the  rate  of  degradation  of 

operator  efficiency,  and  therefore  system  effectiveness,  is  a  function 
of  time.  And  it  is  much  more  rapid  than  even  the  most  critical  "user" 
or  the  most  knowledgeable  designer  would  have  thought  reasonable  when 


the  systems  were  much  less  complicated,  and  demands  on  the  operator  were 
much  less  drastic  than  at  present. 

The  quality  of  the  operator's  decision  is  based,  to  a  great  extent, 
on  his  insight  and  understanding  of  the  problem.   Thus,  the  primary  objec- 
tive of  a  system  which  utilizes  a  human  observer  is  to  provide  the  oper- 
ator with  the  necessary  insight  and  understanding  of  the  problem  to  enable 
him  to  make  the  best  decision  possible.   To  do  this,  the  display  should 
utilize  natural  qualities  of  human  perception  so  as  not  to  drain  the 
operator  psychologically  or  expect  him  to  make  dicisions  based  on  infor- 
mation he  is  not  able  to  fully  perceive. 

The  quality  of  the  operator's  decision  is  also  based  on  the  accuracy 
with  which  the  information  provided  him  actually  represents  the  physical 
situation.   The  purpose  of  the  passive  ASW  processor  is  to  detect  the  pre- 
sence of  certain  narrowband   signals  with  random  phase  variation  from  a 
background  of  random  ocean  noise.   It  has  been  shown  that  the  optimum 
receiver,  in  a  maximum  likelihood  sense,  for  detection  of  such  signals 
is  an  infinite  set  of  Fourier  filters  followed  by  incoherent  integration 
integration  of  power  and  a  decision  element  (Williams,  1972).   The  pre- 
sent level  of  technology  indicates  that  digital  processors  offer  the  high 
resolution,  high  dynamic  range,  repeatability,  and  flexibility  necessary 
to  effectively  produce  such  a  receiver. 

The  purpose  of  this  paper  is  to  discuss  the  digital  processing  of 
acoustic  signals  with  application  to  a  new  generation  of  passive  ASW  sign- 
al processors  being  developed  today.   Specific  mention  will  be  made  of 
techniques  in  the  "Spotlight"  processor  under  development  at  the  Naval 
Postgraduate  School  by  Professor  G.A.  Rahe. 

General  digital  processing  considerations  are  discussed.   The  statis- 
tical properties  of  the  spectral  estimate  are  presented  as  well  as  methods 


for  improving  them.   The  processing  gain  available  from  a  system,  which 
utilizes  the  FFT  as  a  filtering  device  and  incoherently  integrates  the 
power,  is  derived  and  finally  a  brief  description  of  detection  theory 
applied  to  a  human  observer  in  a  sequential  observation  task  is  discussed 


II.   DATA  ANALYSIS 

A.   NOISE  REPRESENTATION 

The  ambient  acoustic  wave  fields  in  the  ocean  are  often  quite  ener- 
getic and  contain  a  wide  range  of  frequencies.   The  study  of  these  noise 
fields  forms  an  important  and  necessary  prelude  to  any  serious  design  of 
detection  systems.   In  the  ocean,  for  example,  one  finds  biological 
noise  sources,  noises  due  to  wave  action  at  the  surface  of  the  water, 
noises  produced  by  meteorological  conditions  such  as  precipitation,  and 
a  great  deal  of  shipping  noise.   For  passive  detection  systems,  these 
noise  fields  furnish  the  background  out  of  which  signals  must  be  detected. 
From  the  experimental  work  of  Parkins  (1968)  it  seems  reasonable  to  as- 
sume that  the  ocean  noise  field  may,  at  least  a  large  percentage  of  the 
time,  be  modeled  as  a  weakly  stationary,  Gaussian  random  process. 

In  order  to  understand  the  theoretical  concepts  of  signal  processing 
it  is  necessary  to  characterize  the  ambient  noise  field  (and  the  desired 
signals)  in  commonly  understood  mathematical  terms.   The  frequency  domain 
(spectral)  representation  in  which  the  "signal"  is  decomposed  into  a  set 
of  sinusoidal  components  as  specific  frequencies  will  be  considered  here. 

Select  a  particular  sample  function  of  noise  and  select  from  that 

sample  an  interval  of  duration  L  extending,  say,  from  t  =  -L/2  to  t  =  L/2, 

(s) 
Such  noise  sample  n    (t)  is  shown  in  figure  2.1(a).   Now  generate  a 

periodic  waveform  in  which  the  waveform  in  the  selected  interval  is  re- 
peated every  L  seconds  as  in  figure  2.1(b).   This  periodic  signal 

(s) 

n    (t)  with  period  L  can  be  represented  as  a  sum  of  harmonically  re- 

lated  sinusoids  (Fourier  series)  as  follows: 


n/  ^(t)  =  A  +  y  A  cos(27rkf  t  -  (j),  )  (2.1) 

L         o   ,  ^,   n         o    ^k 
k=l 

where  A  is  a  DC  level;  A^  and  (J)   are,  respectively,  the  amplitude  and 

phase  of  each  sinusoid  (line);  and  f   (the  fundamental  frequency)  is  1/L. 

Plots  of  A^  and  ^     as  functions  of  frequency  are  referred  to  as  amplitude 

and  phase  spectra,  respectively.   A^  and  c()   are  given  by 


and 


\'=^\+\^        ;k=  1,2,3,...  (2.2) 

(j>j^  =   tan"^(b^/aj^)  ;  k  =  1,2,3,...  (2.3) 


where  L/2 


2 


f 


(s) 


n^^  '(t)  cos(2^kf^t)dt  ;  k  =  1,2,3,...       (2.4) 
-L/2 


\   L  j  '^t    ■  •     -    o 


and 


\-l 


L/2 
n^^^^(t)  sin(27Tkf^t)dt  ;  k  =  1,2,3,...       (2.5) 


-L/2 

(a^  and  b   are  Fourier  coefficients) .   The  DC  level  a  is  obtained  by 

(s) 
calculating  the  time  average  of  n     (t): 

Li 

L/2 

a^  =  A  =  f  f  n/^\t)dt  (2.6) 

o     o    L  J   L 

-L/2 
Several  examples  of  amplitude  and  phase  spectra  of  periodic  signals  are 
shown  in  figure  2.2. 

Although  not  all  periodic  signals  can  be  represented  by  Fourier  har- 
monic series,  the  class  which  can  be  is  large.   The  limitations  of  the 
representation  are  discussed  in  Thomas  and  in  Jenkins  and  Watts.   Several 
other  forms  of  Fourier  series  are  also  discussed  in  the  literature.   The 
form  of  eqn.  2.1  is  a  modified  trigonometric  version  and  is  useful  from 
an  intuitive  standpoint.   Another  form,  which  is  more  convenient  for 
mathematical  manipulations,  is  the  exponential  or  complex  version  in 


4n(^)(t) 


i-^'?(^) 


(b) 


>-  -fc 


>-  t 


Figure  2.1.  (a)  A  sanple  noise  vraveforni.  (b)  A  periodic 
waveforia  is  generated  by  repeating  the  interval  in  (a) 
from  -L/2  t©  L/2. 
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Figure   2.2.      Amplitude   and  phase    aspectra  of   seme   periodic 
signals. 
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(s) 

which  the  signal  n    (t)  is  given  by 

k=-<» 


where  L/2 

■  n/^^Oe-J^Afot^t  ;   k  =  1,2,3,... 


C.4 


L 
L/2 


(2.8) 


The  coefficients  C   are  complex  and  have  the  property  that 


(i.e.  C  and  C   are  complex  conjugates).   These  coefficients  are  related 
to  the  coefficients  in  equation  2.1  by 

C  =  A  =  a  (2.10) 

o    o    o 

A 
^k  ^  2^  ^^     1/2  (aj^  +  jbj^)  (2.11) 

The  exponential  spectrum  of  the  Fourier  series  produces  a  two-sided 
spectrum,  each  of  half  amplitude,  one  at  frequency  f  and  one  at  frequency 
-f.   Although  equation  2.7  indicates  that  the  representation  includes 
negative  frequencies,  this  is  not  of  physical  significance  because,  in 
terms  of  this  representation,  the  sum  of  the  components  at  the  positive 
and  negative  frequencies  produces  a  real  (in  a  mathematical  sense) 
quantity. 

Frequency  domain  representations  of  non-periodic  signals  require  a 
generalization  of  the  Fourier  series  approach.   One  might  intuitively 
reason  that  L  approaches  infinity  for  a  nonperiodic  signal,  which  implies 
that  the  fundamental  frequency  (1/L)  approaches  zero.   If  this  reasoning 
is  carried  out  mathematically,  it  leads  to  a  representation  of  the  sig- 
nal as  a  continuum  of  frequencies  rather  than  a  "line"  spectrum.   The 
continuous  representation  (Fourier  integral  or  Fourier  transform)  is 
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given  by 


n(t)  = 


N(f)e^^''^''  df  (2.11) 


wnere 


00 


N(f)  = 


'  n(t)e  ^^""^^  dt  (2.12) 


In  general,  N(f)  is  a  complex  quantity  that  can  be  represented  by  its 
amplitude  and  phase  components  as  follows: 

N(f)  =  |N(f)|e^*^^^  (2.12a) 

Plots  of   N(f)  and  <j>(f)  as  functions  of  frequency  are  called  ampli- 
tude and  phase  spectra,  respectively,  and  are  analogous  to  the  discrete 
A^  and  c})  .   Several  examples  of  the  amplitude  and  phase  spectra  of  non- 
periodic  signals  are  shown  in  figure  2.3.   Note  the  similarity  between 
the  continuous  spectra  and  the  envelope  of  the  line  spectra  in  figure 
2.2. 

It  should  be  noted,  however,  that  not  all  time  functions  possess  a 
Fourier  transform.   The  condition  for  the  existence  of  the  Fourier  trans- 
form is  that  the  Fourier  integral  exist  in  the  limit  as  T->°°.   Existence 
is  guaranteed  by  absolute  convergence  of  the  integral  of  equation  2.12, 


n(t)  1^  dt  <  ~  (2.12b) 


A  number  of  functions  having  great  usefulness  do  not  meet  this  re- 
quirement; functions  such  as  sine  waves  or  step  functions.   These  func- 
tions are  nevertheless  being  handled  by  allowing  the  Fourier  transform 
to  contain  impulses.   In  practice,  the  class  of  signals  which  are  Fourier 
transformable  is  sufficiently  large  to  encompass  a  wide  variety  of  ap- 
plications. 
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Figure  2.3.   Amplitude  and  phase  spectra  of  some  nonpcriodic 
signals. 
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Fourier  series  and  Fourier  transforms  of  signals  differ  in  their 
theoretical  solutions  attributable  primarily  to  the  fact  that,  for 
practical  purposes,  only  finite  length  signals  can  actually  be  sampled 
and  computed.   The  results  of  this  computation  are  used  only  to  provide 
estimates  of  the  theoretical  solutions.   As  seen  in  equation  2.7,  the 
series  method  requires  the  sum  of  an  infinite  number  of  discrete  sinu- 
soids and  the  integral  method  requires  an  infinitesimal  frequency  re- 
solution.  Therefore,  only  approximations  to  these  representations  are 
obtained  in  practice.   The  discrete  Fourier  Transform  (DFT) ,  which  will 
be  discussed  later,  approximates  a  Fourier  series  by  arbitrarily  forcing 
the  signal  to  be  periodic  and  then  truncating  it  to  a  finite  number  of 
terms. 

B.   DIGITAL  SIGNAL  PROCESSING 

During  the  decade  of  1960-1970,  an  important  new  approach  to  wave- 
form manipulation,  or  signal  processing,  came  into  importance.   It  be- 
came practical  to  represent  information-bearing  waveforms  digitally,  that 
is,  symbolically,  and  to  do  signal  processing  on  the  digital  representa- 
tion of  the  waveform.   Thus  all  the  flexibility  of  digital  computers 
became  available  for  processing  of  waveforms,  and  this  has  had  an  effect 
on  all  fields  in  which  waveform  processing  is  important.   It  became  ap- 
parent that  one  could  manipulate  waveforms  digitally  in  ways  that  would 
be  totally  impractical  with  continuous  representations.   One  advantage 
in  this  respect   is  the  freedom  from  time  constraints  when  a  waveform  is 
in  a  digital  memory.   Once  a  waveform  is  in  a  computer  memory,  it  may 
be  considered  to  have  any  desired  time  base  associated  with  it.   As  a 
result  of  the  Fast  Fourier  Transform  (FFT)  algorithm,  it  became  practical 
to  do  signal  processing  on  waveforms  in  either  the  time  or  the  frequency 
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domain,  something  never  practical  with  continuous  systems.   Ihe  Fourier 
transform  became  not  just  a  theoretical  discription,  but  a  tool;  and 
real-time  digital  spectral  analysis  became  a  reality.   Some  of  the 
practical  considerations  of  perfoirniing  this  processing  will  be  discussed 
in  this  section. 

1.   Sampling  Considerations 

Digital  signal  processing  is  based  on  the  principle  that  an  ana- 
log signal  can  be  represented  by  the  values  of  the  signal  at  discrete 
points  in  time.   This  principle  is  embodied  in  the  sampling  theorem, 
which  states  that  a  band-limited  signal  is  completely  characterized  by 

a  set  of  samples  taken  at  a  minimum  rate  of  2f     (called  the  Nyquist 

max 

rate),  where  f    represents  the  highest  frequency  component  in  the 
max 

signal.   Although  this  statement  of  the  sampling  theorem  appears  straight- 
forward, there  are  several  practical  considerations  which  should  be  dis- 
cussed. 

One  difficulty  in  applying  the  sampling  theorem  is  that  actual 
signals  do  not  occupy  well  defined  frequency  bands,  i.e.,  the  spectra 
of  real  signals  tend  to  gradually  roll  off  until  the  amplitude  of  the 
spectral  components  reaches  a  minimum  detectable  level.   Therefore,  most 
sampling  systems  have  a  filter  just  before  the  quantizing  device  to 
limit  the  signal  to  a  predetermined  band.   This  filter  is  called  an  anti- 
aliasing filter. 

When  analog  data  are  quantized  i.e.,  converted  from  a  continuous 
waveform  to  discrete  values,  at  a  rate  of  f   samples  per  second,  the  ana- 
log spectrum  is  divided  into  frequency  bands  of  width  f  /2  (figure  2.4a). 
During  sampling,  the  energy  in  each  band  is  folded  or  aliased  into  the 

lowest  band  (0  to  f  /2)  and  f  /2  is  called  the  folding  frequency.   The 

s         s 
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folding  operation  can  be  envisioned  if  the  bands  in  figure  2.4a  are 
viewed  from  "above",  as  in  figure  2.4b.   The  original  analog  spectrum  is 
collapsed  into  the  lowest  frequency  band,  and  the  even-numbered  bands  are 
inverted  in  frequency  as  they  are  folded.   Therefore,  the  net  result  of 
the  sampling  process  is  that,  in  spectra  computed  from  the  digital  sam- 
ples, all  energy  in  the  original  analog  spectrum  appears  in  the  band 
0  to  f  /2  as  seen  in  figure  2.4c.   Figure  2.5a   shows  an  example  of  a 
typical  aliasing  situation.   It  is  obvious  that  the  spectrum  computed 
from  the  digital  samples  (figure  2.5b)  provides  very  little  information 
concerning  the  actual  frequency  content  of  the  original  signal  because 
the  original  location  of  the  computed  spectral  components  is  unknown. 
Only  by  containing  the  original  spectrum  in  a  single  known  band  can 
the  frequency  content  be  unambiguously  determined. 

Typically  we  think  of  two  types  of  antialiasing  filters,  the 
low-pass  filter  and  the  band-pass  filter.   Both  have  application  in 
digital  signal  processing  and  will  be  briefly  discussed  here. 

The  low-pass   anti-aliasing  filter  will  be  assumed  to  have  a 
frequency  response  function  as  shown  in  figure  2.6.   The  amplitude  level 
of  the  aliased  components  is  given  by 

A  =  M, ^"  /  ^^ i^^-^  (2.13) 

a    h      -^°8-j^q2 

=  3.32  M^  log^Q(f^/f^  -  1)   db  (2.14) 

where  f   is  the  sampling  frequency,  f,  is  the  filter  cutoff  frequency,  and 
s  h 

K     is  the  filter  rolloff  slope  (db/octave).   Equation  2.14  applies  when 
the  original  analog  signal  spectrum  is  assumed  to  have  components  at 
frequencies  that  are  above  the  folding  frequency  and  equal  in  amplitude 
to  components  in  the  pass  band. 
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(a)  Effect  of  sampling  the  original  sjfialog  spectrum 


(b)  Folding  operation  viewed  from  "above". 
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(c)   Bandwidth  of   tlio    spectrira   cori;^uted  frori   the   digital 
samples. 


Figure   2.4.    Aliasing  effect    ©f   sampling  on   the   Fourier 
spectrum. 
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Figure   2.5.      Typical   afttialiassing  situation 
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f  =   campling  frequency 
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M,=  filter  rolloff  slope (db/octave) 

A  =  ajnplitude  of  aliased  component 
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Figure  2.6.   Low-pass  antialiasing  filter 
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High-quality  variable-cutoff  commercially  available  filters 
generally  have  rolloff  slopes  from  -12  to  -48  db/octave.   Sample  fre- 
quencies usually  are  approximately  three  to  four  times  the  filter  cut- 
off frequency  and  in  order  to  provide  a  margin  of  safety  from  aliasing 
effects,  an  anti-aliasing  filter  slope  of  at  least  -36  db/octave  and 
preferrably  -48  db/octave  are  desirable.   These  figures  apply  to  situa- 
tions in  which  a  variety  of  general-purpose  signal  processing  techniques 
will  be  applicable.   Less  conservative  parameters  can  be  substituted  as 
the  situation  dictates. 

A  bandpass  signal  of  bandwidth  W  and  center  frequency  f   can  be 
sampled  at  a  rate  greater  than  or  equal  to  twice  the  bandwidth  rather 
than  twice  the  highest  frequency  component,  if  appropriate  antialiasing 
filtering  is  performed.   The  bandpass  antialiasing  filter  will  be  assumed 
to  have  the  frequency  response  function  shown  in  figure  2.8.   Once  the 
sample  frequency  has  been  chosen,  the  level  of   the  components  aliased 
from  above  the  band  of  interest  (assuming  they   are  equal  in  amplitude 
to  the  passband  components)  is  given  by 

where  f  is  the  upper  filter  cutoff  frequency,  M^  is  the  filter  slope 
(db/octave),  f  is  the  sample  frequency,  and  K  is  the  band  number  (see 
figure  2.7).   The  level  of  the  components  aliased  from  below  the  band 
of  interest  is  given  by 

*s,l=-^-^'"jl^°Sio[(K-l)f3/f,  -11   db        ■      (2.16) 

where  f  and  M  are  the  lower  filter  cutoff  and  slope,  respectively. 

The  procedure  for  choosing  the  sample  frequency  in  bandpass  sit- 
uations is  not  as  straightforward  as  in  the  low-pass  case.   However,  a 
fairly  simple  iterative  procedure  is  as  follows. 
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Figure  2.7.   Sampling  of  a  bandpass  signal 
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Figure  2.8,   Bandpass  antialiasing  filter 
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1.  Define  an  effective  signal  bandwidth  W'  that  is  about  two 

to  four  times  the  actual  bandwidth  of  interest,  depending  on  the  steepness 
of  the  filter  rolloff. 

2.  Select  a  sampling  band  by  choosing  K  to  be  the  largest  in- 
teger such  that  K  <  f  /W'  +  1/2.   This  will  guarantee  that  the  signal 
band  W'  will  be  encompassed  by  a  sampling  band. 

3.  Define  the  sample  frequency  to  be 

2f 
^s=KlI72  (2.17) 

4.  Calculate  values  for  A  ,  and  A  „.   If  these  values  are  not 

sh      s£ 

acceptable,  repeat  steps  1  through  4  using  a  larger  effective  bandwidth  W'. 

5.  If  it  is  inconvenient  to  generate  the  exact  sample  frequency 
calculated  in  eq.  2,17,  it  may  be  adjusted  as  needed.   Recalculation  of 

A   and  A   will  indicate  if  a  significant  amount  of  aliasing  will  result 
from  the  adjustment. 

The  effect  of  sampling  properly  filtered  bandpass  signals  at  a 
rate  proportional  to  the  bandwidth  is  the  bandshift  the  signal  spectrum 
to  the  baseband  (figure  2.4c).   This  means  that  when  spectra  are  computed 
with  digital  samples  the  frequency  axis  must  be  labeled  in  accordance 
with  the  location  of  the  original  signal  spectrum.   This  location  is  a 
fimction  of  the  sampling  band  number  K  and  sample  rate  f  .   For  odd- 
numbered  bands,  the  frequency  axis  should  be  labeled  from  (K-l)f  /2  to 
kf  II   as  shown  in  figure  2.9a.   For  even-numbered  bands,  the  scale  runs 

from  Kf   to  Kf  /2  (figure  2.9b).   Note  that  in  the  even-numbered  bands,  it 
s      s 

is  desirable  to  rotate  the  plot  about  the  vertical  axis  and  translate  the 
frequency  axis  such  that  it  runs  from  Kf  /2  to  Kf  . 
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Figure  2.9.  Relabeling  of  frequency  axis  of  computed 
spectra 
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2.   Dynamic  Range  and  Quantization  Error 

The  dynamic  range  of  an  analog-to-digital  (A/D)  converter  can 

be  defined  as  the  ratio  between  the  maximum  and  minimum  peak  signal 

values  or  as  the  signal-to-quantization  noise  ratio,  depending  on  the 

usage  of  the  digital  samples.   The  maximum  peak  signal  range  in  db 

(figure  2.10)  is  given  by 

V 

D  =  20  log   -^^^-r:,  (2.18) 

V   /2 
max 

=  6.02(n-l)  (2.19) 

where  V    is  the  maximum  input  voltage  level  and  n  is  the  number  of 
max 

bits  (including  sign)  of  the  bipolar  A/D  converter.   For  example,  a  12- 
bit  bipolar  A/D  converter  has  a  peak  signal  range  of  about  66  db,  which 
is  compatable  with  most  analog  instrumentation  devices,  e.g.,  tape  re- 
corders. 

Quantization  noise  is  caused  by  the  stepwise  approximation  of 
the  digital  samples  to  the  original  continuous  signal  and  can  be  treated 
as  an  additive  noise  component  (figure  2.10).   The  RMS  quantization 

noise  is  given  by 

■V 
N^  =  0.29  -^  (2.20) 

Q        2^^"^ 

The  RMS  signal-to-quantization  noise  ratio  (assuming  a  sine  wave 

of  peak  amplitude  V   )  in  db  is  given  by 
*^      ^        max 

SN  =  20  log  „    max^ .  (2.21) 

"  n-1 

.29  V    /2''  ^ 
max 

2n-l 
=  20  log^^o  -Ja  ^^'^^^ 

=  20  log^Q2''"^  +  7.74  (2.23) 

=.  D  +  8  (2.24) 
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Figure  2.10.  Effects  of  quantizing  an  analog  signal 
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Therefore,  a  ].2-bit  bipolar  A/D  converter  will  have  a  signal-to-quanti- 
zation noise  ratio  of  about  74  db. 

Eqs.  2.19  and  2.24  both  yield  optimistic  numbers,  i.e.,  it  is 
usually  not  possible  to  sufficiently  control  the  amplitude  of  the  input 
signal  to  the  A/D  converter  so  that  the  full  voltage  range  is  used. 
This  means  that  the  dynamic  range  can  be  reduced  by  3  to  6  db,  depending 
on  the  variability  of  the  input  signal.   A  buffer  amplifier  prior  to 
the  A/D  converter  is  usually  needed  to  condition  the  signal  level. 
3.   Discrete  Fourier  Transform  (DFT) 

When  analyzing  signals  whose  parameters  are  known  within  broad 
limits,  one  of  the  first  operations  to  be  performed  is  some  type  of 
spectral  analysis.   This  can  be  a  single  analysis,  a  series  of  analyses 
to  indicate  the  time  dependence  of  the  spectral  components,  or  an  aver- 
aged spectral  computation  to  approximate  a  power  spectral  density  func- 
tion.  This  section  discusses  some  of  the  considerations  for  using 
digital  Fourier  Transform  techniques  to  analyze  sampled  signals. 

The  DFT  computes  a  finite  Fourier  series  to  represent  a  signal 
over  a  pre-defined  time  interval.   The  time  interval  L  is  chosen  to 
include  as  much  of  the  original  signal  as  desired  (figure  2.11a).   The 
DFT,  since  it  is  a  representation  applicable  to  periodic  signals,  treats 
the  signal  as  if  it  were  periodic  with  period  L  (figure  2.11b).   The 
discontinuities  which  may  appear  because  of  this  forced  periodicity  are 
included  in  the  representation  and  are  of  primary  concern  when  performing 
DFT  analyses. 

The  DFT  representation  is  closely  analogous  to  the  frequency 
domain  representation  for  periodic  signals  (eqs.  2.1  through  2.6);  how- 
ever, it  is  necessary  to  express  the  mathematical  operations  in  terms  of 
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Figure  2.11.  DFT  processing 
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sampled  signal  values.   It  will  be  assumed  that  the  signal  to  be  repre- 
sented or  analyzed  consists  of  a  set  of  N  equally  spaced  samples  over 
the  time  interval  L  as  follows: 

s(t.)  =  s(i-At)         i  =  0,1,2, ...,N-1  (2.25) 

where  At  is  the  time  between  samples.   The  sample  spacing  is  given  by 

At=|  =  f  (2.26) 

s 

where  f  is  the  sample  frequency.   The  representation  of  the  original 

signal  s(t)  over  the  interval  L  is 

N-1   . 

s(t)    =  A     +     y     A     cos(2TTnAft  +   9   )    ;    0    <   t    <   L  (2.27) 

o^^n  n—       — 

n=l 

where 

Af  =   1/L  (2.28) 

the  fundamental  frequency  or  "bin"  spacing.   The  amplitude  and  phase 

spectra  are 


A  =  /  a   +  b  (2.29) 

n      n     n 

e  =  -Tan"-'"(b  /a  )  (2.30) 

n  n  n 


where       ^    «  N-1 


a  =  -  ls(t.)  cos(2iTnAft.)  ;  n  =  1,2,...,N-1        (2.31) 
i=l  .      ' 

b  =  ^  y  s(t.)  sin(2TmAft.)  ;  n  =  1,2,...,N-1        (2.32) 
1=1 


The  DC  term  A  is 
o 


N-1 


4.l^(^i> 


a  =  A 
°    °   "  i=0 

The  terms  a   and  b   are  commonly  called  the  real  and  imaginary  Fourier 
n      n  o    ^ 

coefficients,  respectively. 
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When  the  signal  to  be  analyzed  by  DFT  methods  is  real,  the  am- 
plitude and  phase  spectra  have  certain  symmetrical  properties.  Mathe- 
matically, the  symmetry  is  given  by 

>/2)+(l-l/2)  =  \m)-a-l/2)    ■'    ^  =  1.2. •■•.N/2      (2-3'>) 
which  states  that  the  amplitude  spectrum  has  even  symmetry  about  the 
point  (N-l)/2.   The  phase  spectrum  exhibits  odd  symmetry  about  the 
point  (N-l)/2.  i.e. 

^(N/2)-(i-l/2)  "  ®(N/2)+(i-l/2)  '  ^  "  1,2,..., N/2      (2.35) 
In  terms  of  frequency,  the  point  (N-l)/2  can  be  expressed  as  f  /2,  the 
folding  frequency.   The  portion  of  the  computed  spectrum  between  (N-l)/2 
and  N-1  can  be  treated  as  the  negative  half  of  the  spectrum  which  would 
normally  occupy  the  band  -(N-l)/2  to  0.   For  real  signals,  only  the  values 
between  0  and  (N/2)-l  must  be  considered. 

Signal  truncation  introduces  broadband  leakage  into  the  computed 
spectra,  i.e.  seemingly  narrowband  signals  will  appear  to  occupy  more 
bandwidth  than  expected.   Severe  discontinuities  in  the  pseudoperiodic 
signal  may  be  present,  depending  on  the  relationship  of  the  signal  fre- 
quencies to  the  length  of  the  period  L  (see  figure  2.11).   This  can  most 
easily  be  illustrated  for  the  case  of  a  sinusoidal  signal.   If  the  signal 
frequency  f  is  such  that 

f   =  KAf  ;  K  =  l,2,...,(N/2)-l  (2.36) 

(f  is  harmonically  related  to  Af)  the  computed  amplitude  spectrum  will 
consist  of  a  single  line  component  (figure  2.12a).   If  the  signal  fre- 
quency is  offset  by  the  amount  Af/2,  the  spectrum  shown  in  figure  2.12b 
is  obtained.   The  apparent  spectral  width  of  this  signal  depends  more  on 
the  truncation  effects  of  the  DFT  calculation  than  on  the  actual  signal 


30 


fs(t) 


Ml- 


'    1     ■    ■   ■     ■      I f 


n 


(a)   Sine  wave  harmonically  related  to  Af 


A 


n 


Till 


n 


(b)  Sine  wave  midway  betv/een  tv/o  DPT  frequency  bins 


Figure  2.12.   DFT  amplitude  spectra 
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bandwidth.   The  solution  to  this  problem  is  to  apply  a  "smoothing" 
window  to  the  input  data  to  eliminate  or  minimize  the  discontinuities. 
Windowing  functions  will  be  discussed  in  the  next  section. 

Although  the  subtleties  of  the  DFT  implementations  are  beyond 
the  scope  of  this  discussion,  there  are  some  implementation  considera- 
tions with  which  the  analyst  must  be  acquainted.   Until  several  years 

ago,  spectriim  analysis  by  digital  techniques  was  an  extremely  time- 

2 
consuming  process,  i.e.,  computation  times  proportional  to  N  ,  where 

N  is  the  number  of  points  in  the  transform. 

Presently,  most  DFT  calculations  are  performed  with  fast  Fourier 
transform  (FFT)  algorithms  which  have  computation  times  proportional  to 
N  log  N  (N  is  assumed  to  be  a  power  of  2).   The  FFT  has  made  it  possible 
to  perform  digital  spectral  analysis  calculations  in  reasonable  lengths 
of  time,  making  on-line  digital  spectral  analyzers  a  reality.   However, 
at  least  in  small  computer  systems,  these  algorithms  are  generally  im- 
plemented in  fixed  point  arithmetic  which  is  subject  to  overflow.   Thus, 
considerable  care  must  be  taken  to  properly  scale  the  FFT  computations, 
especially  in  situations  where  the  signal  energy  falls  into  a  small  num- 
ber of  frequency  bins.- 

These  overflows  are  particularly  troublesome  because  they  gen- 
erally appear  as  spurious  components  at  unpredictable  frequencies.   The 
best  way  to  prevent  overflow  while  still  maintaining  the  dynamic  ampli- 
tude range  of  the  calculation  if  to  perform  dynamic  scaling  on  inter- 
mediate computed  values  in  the  algorithm.   All  scaling   factors  should 
be  stored  in  a  way  that  will  permit  the  computed  output  values  to  be 
calibrated  with  respect  to  the  known  input  voltages.   Experience  in- 
dicates that  this  type  of  scaling,  along  with  appropriate  windowing, 
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provides  a  dynamic  range  (ratio  of  the  peak  spectral  component  to  the 
computational  noise  floor)  of  at  least  80  db  when  implemented  on  a  com- 
puter with  16-bit  words. 
4.   Window  Functions 

In  practice  it  is  only  possible  to  obtain  finite  lengths  of 
records.   The  statistical  questions  to  be  considered  later  stem  from  the 
fact  that  it  is  necessary  to  estimate  the  accuracy  of  various  functions 
obtained  from  finite  amounts  of  data.   From  the  preceding  discussion  on 
truncation  effects  and  spectral  leakage,  it  should  be  clear  that  the 
operation  for  taking  a  finite  length  record  is  equivalent  to  multiplying 
the  actual  signal  s(t)  by  the  data  window  w(t).   The  DFT  actually  uses 
samples  from  the  function  s'(t),  which  is  given  by 

s'(t)  =   w(t)  s(t)  (2.37) 

where  s(t)  is  the  original  signal  and  w(t)  is  zero  everywhere  but  on  the 
interval  0  to  L.   In  the  frequency  domain,  eq.  2.37  becomes 

s^(f)  =  w(f)*s(f)  (2.38) 


S(g)W(f-g)dg  .    (2.39) 


where  *  denotes  convolution.   This  means  the  actual  spectrum  of  s(t) 
is  convolved  with  the  Fourier  Transform  of  the  window  function  w(t). 
The  DFT,  in  effect,  samples  this  result  at  integral  multiples  of  1/L. 
Therefore,  the  shape  of  the  window  function's  spectrum  determines  the 
degree  of  spectral  spreading  when  a  narrowband  signal  is  truncated. 

Any  reasonable  data  window  w(t)  will  produce  a  spectral  window 
W(f)  which  is  concentrated  about  f=0  but  with  side  lobes  which  damp  out 
as  f  gets  further  away  from  zero.   For  small  L,  S  (f )  may  give  a  very 
distorted  picture  of  S(f)  since  the  window  W(f-g)  will  be  wide  and  hence 
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values  of  S(g)  far  removed  from  g  =  f  will  contribute  to  S  (f)  in  the 
integral.   As  L  becomes  large  the  distortion  will  be  reduced.   Finally, 
as  L  tends  to  infinity,  the  transform  component  at  frequency  f  can  be 
fully  determined,  since  data  windows  will  tend  to  the  generalized 
function  6  as  L  tends  to  infinity.   Hence,  as  L  tends  to  infinity,  W(f-g) 
tends  to  a  delta  function  centered  at  g  =  f  and  so  S  (f)  tends  to  S(f). 
In  reality,  of  course,  this  ideal  situation  is  impossible  since  we  are 
always  dealing  with  finite-length  records. 

Although  there  are  many  different  windov^7  functions  in  the  lit- 
erature, the  advantages  of  windowing  the  data  may  be  adequately  observed 
by  noting  the  differences  between  the  rectangular  window  (i.e.  simply 
truncating  the  data)  and  the  Banning  (Cosine)  window.   It  should  be 
noted,  however,  that  the  application  of  any  data  window  is  simply  a 
tradeoff  between  the  bandwidth  of  the  main  lobe  and  the  attenuation  of 
the  side  lobes  (i.e.  a  spectral  window  with  a  narrow  main  lobe  will  have 
as  slower  rate  of  attenuation  of  the  side  lobes  than  a  window  V7ith  a 
wider  main  lobe) .   This  comparison  can  be  seen  for  the  rectangular 
window  and  the  Banning  window  in  the  next  two  sections. 

a.   Rectangular  Window 

The  rectangular  window  is  applicable  when  samples  of  the 
original  signal  are  used  directly  by  the  DFT.   This  function  is  given  by 

^0    elsewhere 
(see  figure  2.13a).   The  Fourier  transform  of  w(t)  is 

W(f)    =   LeJ2^^^     sinrrfL  ^2.41) 

TTfL 

(see    figure   2.13b   and   c) .      The   actual   location   of   the   sin   x/x  function, 
and  hence   the   amounts   of   energy    leakage,    in   relation   to   the   DFT   frequency 
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(b)  Frequency  domain  with  input  signal  harmonically 
related  to  l/L. 
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(c)  Frequency  domain  with  input  signal  half-way 
between  two  bins. 


Figure  2.13.  Rectangular  window 
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bins,  depends  on  the  frequency  of  the  input  signal  to  which  the  windov/  is 
applied.   For  an  input  signal  frequency  which  is  harmonically  related  to 
Af,  the  sin  x/x  function  is  sampled  as  shown  in  figure  2.13b  and  all  sig- 
nal energy  plus  any  other  energy  which  might  be  present  (noise)  under  the 
main  and  side  lobes  is  reported  as  being  present  at  the  input  signal  fre- 
quency.  For  a  signal  frequency  halfway  between  the  two  bins,  the  sin  x/x 
function  is  sampled  as  shown  in  figure  2.13c  and  the  signal  energy  is  at- 
tenuated and  spread  throughout  all  the  side-lobes  giving  a  very  distorted 
picture  of  the  actual  signal.   The  relative  size  of  the  side-lobe  area 
compared  with  the  main  lobe  area  is  very  important  when  considering  the 
amount  of  energy  leakage.   Figure  2.14  shows  the  spectrum  of  the  rectan- 
gular-time window  drawn  to  scale. 
b.   Manning  Window 

The  Hanning  Windov7  is  adequate  for  general-purpose  spectral 
analysis  and  is  easily  applied  to  data  either  in  the  time  or  frequency 
domain.  • 

The  Hanning  Window  is  defined  by 

1/2(1  +  cos  ^^)     -L/2  1  t  <  L/2 
W(t)  =  {  (2.42) 

0  Elsewhere 


(see  figure  2.15).   The  Fourier  Transform  of  W(t)  is 


,  ,  _  L/simrfL   ^i  sin(TTfL-TT)    1^   sin(7TfL+Tr)\ 

WU;     -     2\TTfL  2         (TTfL-TT)  2         (TTfL+TT)  } 


(2.43) 


_   L        sirnrfL 
"    2    TTfL(l-f^L^) 

From  equation  2.43  it  is  seen  that  the  Hanning  Window  turns 

out  to  be  the  summation  of  three  sin  x/x  functions.   Since  the  main  lobes 

are  spaced  1/L  Hz  apart,  this  window  has  the  effect  of  averaging  three 

adjacent  lobes,  giving  the  center  one  twice  as  much  weight  as  the  side 
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ones.   Thus,  there  will  be  a  tendency  to  greatly  decrease  the  size  of 
the  side  lobes.   The  main  lobe  is  reduced  to  one-half  of  its  previous 
value  and  its  width  is  doubled.   The  reduction  in  leakage,  therefore, 
results  in  a  corresponding  widening  of  the  bandwidth  of  analysis.   The 
distance  between  the  first  zero  crossing  on  either  side  of  the  main  lobe 
is  2/L  Hz.   Figure  2.15  shows  the  spectrum  of  the  Hanning  window  drawn 
to  scale. 

C.   PROCESSING  OF  REAL  DATA 

To  see  how  some  of  the  ideas  presented  thus  far  are  applied  to  real 
data,  consider  the  method  of  processing  being  used  in  the  new  "SPOTLIGHT" 
acoustic  signal  processor  being  developed  by  Prof,  G.  A.  Rahe  at  the 
Naval  Postgraduate  School,  Monterey. 

The  idea  behind  the  processor  is  to  present  the  observer  with  a 
continually  updated  history  of  the  spectrum  of  the  incoming  signal  and 
to  allow  him  to  interpret  the  data  appropriately.   The  display  of  the 
spectrum  is  presented  as  a  three-dimensional  random  surface  on  a  CRT 
screen  as  shown  in  figure  2.16.   Each  successive  update  of  the  display 
represents  an  estimate  of  the  incoming  signal's  spectrum  over  a  length 
of  time  T  seconds  long.   As  each  new  estimate  is  presented,  the  oldest 
one  is  dropped  off  the  back  of  the  display  and  a  new  one  appears  on 
the  front  end.   Each  spectral  estimate  is  formed  as  follows: 

1.  A  data  window  w  of  duration  L  seconds  is  applied  successively 
to  the  available  data  in  the  overlapping  intervals  (0,L),  (S,S+L),... 
[(P-1)S,T],  where  S  is  the  amount  of  shift  each  adjacent  data  window 
undergoes,  and  P  is  the  total  nioraber  of  pieces  or  segments  employed 

(see  figure  2.17) . 

2.  A  DFT  is  performed  on  each  of  the  P  windowed  sections  using 
an  FFT  algorithm  and  a  raw  estimate  is  computed  as  the  magnitude  squared 
of  the  DFT.  39 
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Pigure  2.16.   "SPOTLIGHT"  processor  display 
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Figure  2.17.   Overlapped  data  windows 
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3.   The  smoothed  estimate  is  computed  as  the  average  of  the  P 
raw  estiarates. 

Since  L  seconds  of  data  are  used  for  each  estimate  we  see  that 

(P-l)S  +  L  =  T  (2.A4) 

Each  estimate  need  not  and  most  probably  will  not  be  independent  of  the 
previous  estimate.   The  update  rate  of  the  estimates  is  a  function  of 
the  operator's  needs.   The  choice  of  the  parameters  L,  T,  and  S  is  dis- 
cussed in  the  next  section, 

D.   PRACTICAL  CONSIDERATIONS 

The  effectiveness  of  the  processor  depends  on  its  ability  to  make 
consistently  stable  spectral  estimated  using  intelligently  chosen  para- 
maters.   Some  of  the  more  practical  aspects  of  the  spectral  estimate 
are  now  discussed.   The  parameters  associated  with  making  the  estimates 
must  necessarily  be  chosen  prior  to  the  processing  of  any  data  and  should 
be  made  to  meet  certain  specifications. 

The  sampling  rate  must  be  chosen  so  that  the  spectrum  can  be  esti- 
mated in  the  frequency  range  of  interest,  0  ^  f  ^  f -i  •   Hence,  the  sam- 
pling rate  must  be  at  least  2f  .   Care  must  be  taken  to  avoid  aliasing 
as  was  discussed  in  section  IIB.   This  is  usually  done  with  an  anti- 
aliasing filter  prior  to  sampling  the  signal  so  that  the  power  above 
f-  in  the  spectrum  is  effectively  removed. 

Associated  with  reproducing  the  fine  details  of  the  spectrum  is  the 
fidelity  of  the  estimate.   To  achieve  high  fidelity,  the  "Bandwidth"  of 
the  window  used  must  be  of  the  same  order  as  the  width  of  the  narrowest 
important  detail  in  the  spectrum.   It  is,  therefore,  useful  to  be  able 
to  make  an  intelligent  estimate  as  to  the  width  of  the  smallest  impor- 
tant detail  required  by  the  system  to  be  "detected".   For  example,  if 
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the  width  of  the  narrowest  important  detail  in  the  spectrum  is  known  to 
be  .15  Hz,  then  as  spectral  window  whose  "bandwidth"  is  of  the  order  of 
.15  Hz  or  smaller  must  be  used  to  be  able  to  resolve  the  fine  details 
of  the  spectrum.   Note  that  to  resolve  a  spectrum  to  the  same  amount  of 
detail,  more  data  (a  larger  transform)  would  be  required  if  using  the 
Hanning  vjindow  than  if  using  the  rectangular  window  since  the  "bandwidth" 
of  the  Hanning  window  is  greater  than  that  for  the  rectangular  window 
(see  figures  2.14  and  2.15).   Assuming  that  the  hardware  is  available, 
one  should  not  be  lulled  into  ignoring  the  above  considerations  and 
using  the  finest  resolution  available,  because  the  requirements  for  high 
fidelity  and  high  stability  (to  be  discussed  next)  are  conflicting  and 
any  practical  realization  is  necessarily  a  compromise  between  the  two. 
It  seems  appropriate  at  this  time  to  say  that  the  frequency  "band- 
width" of  a  narrow  band-pass  filter  (window)  is  described  in  several 
different  ways.   Three  of  the  most  common  descriptions  for  bandwidth 
which  are  of  interest  to  the  power  spectral  density  measurement  are 
briefly  described  here. 

1.   The  half-power  point  bandwidth  B   is  defined  as 

r 

\  =   ^2   ~   ^1  ^^'^^^ 

where  o  o  o 

iw(f^)r  =  iw(f2)r  =  1/2  \\\  . 

B  is  the  frequency  interval  between  the  upper  and  lower  frequencies 
where  the  filter  attenuates  an  applied  signal  3  db  below  maximum  trans- 
missibility .   This  description  of  bandwidth  is  convenient  because  it  is 
easy  to  measure,  but  is  otherwise  of  little  physical  significance  for 
random  signal  applications. 
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2.  The  noise  bandwidth  B   is  defined  as 

n 

|w(f) 1^  df 
K   =  ^ 9 (2.46) 

is  the  bandwidth  of  a  hypothetical  rectangular  filter  which  would  pass  a 
signal  with  the  same  mean  square  value  as  the  actual  filter  when  the  in- 
put is  white  noise.   B  is  a  convenient  measure  of  bandwidth  to  use  for 
^  n 

normalizing  narrowband  mean  square  value  measurements,  as  required  for 
analog  pov7er  spectral  density  measurements. 

3.  The  equivalent  statistical  bandwidth  B   defined  as 

s 


r. 


.2    2 
|W(f) I   df 

B  =  -^^ (2.47) 

s 

r 

|W(f)|^  df 
0 

is  the  bandwidth  of  a  hypothetical  rectangular  filter  which  would  pass 
a  signal  with  the  same  mean  square  value  statistical  error  as  the  actual 
filter  when  the  input  is  white  noise.   In  other  words,  B   is  the  descrip- 
tion of  filter  bandwidth  which  is  technically  appropriate  when  considering 
the  statistical  error  e  of  a  properly  resolved  power-spectral  density 
measurement  as  given  by 

e  =  -^—  (2.48) 

e 

For  the  case  of  an  ideal  filter,  the  three  descriptions  of  bandwidth 
are  all  equal.   For  any  other  filter  characteristic,  however,  the  band- 
width terms  will  generally  be  different.   A  comparison  of  these  three 
definitions  of  bandwidth  for  the  rectangular  and  Banning  data  windows 
is  seen  below  in  table  2.1. 
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Data  Window 

B 

r 

B 

n 

B 
s 

Rectangular 
Banning 

.85 
L 

1.4 

L 

1 
L 

1.5 

L 

1.23 
L 

2.08 
L 

Table  2.1.   Comparison  of  Bandwidth  Definitions 

For  finite  records,  the  extent  to  which  the  width  of  peaks  can  be 
estimated  and  fine  detain  detected  is  also  influenced  by  the  variance 
of  the  estimate.   Hence,  to  be  able  to  trust  the  fine  structure  of  the 
spectrum,  it  must  be  possible  to  tie  down  the  estimate  to  a  given 
stability.   The  estimator  is  said  to  have  high  stability  if  the  variance 
of  the  smoothed  spectral  estimate  is  small.   A  convenient  description  of 
a  spectral  estimate  is  given  by  its  "equivalent  number  of  degrees  of 
freedom",  K,  defined  (Blackman  and  Tukey,  1968)  as 

2 


^  ^   2 (Average) 
Variance 


(2.49) 


Since  a  small  variance  is  associated  with  a  stable  estimate,  the  larger 
the  equivalent  number  of  degrees  of  freedom  the  more  stable  the  estimate, 
It  will  be  shown  in  section  III  that  maximum  stability  is  obtained  by 
overlapping  the  data  windows  as  shown  in  figure  2.17.   The  amount  of 
overlap  used  to  achieve  the  maximum  stability  is  dependent  upon  the  par- 
ticular data  window  used.   For  a  fixed  transform  length  L,  windows  with 
lower  sidelobes  have  wider  and  smoother  main  lobes  and  therefore  require 
less  overlap  to  achieve  maximum  stability. 

In  addition  to  the  fidelity  and  stability  of  the  estimated  spectrum, 
there  are  some  other  important  considerations  to  be  made  when  choosing 
the  parameters  to  be  used  in  making  the  estimates.   For  instance,  the 
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statistics  of  the  background  must  be  a  factor  in  determining  the  length 
of  time,  T,  used  in  the  smooth  estimates.   A  particular  advantage  to 
this  method  of  obtaining  the  spectral  estimates  is  that  L  may  be  chosen 
so  that  slow  variations  in  the  background  do  not  affect  the  performance 
of  the  processor.   If  necessary,  the  frequency  resolution  of  the  esti- 
mate may  be  adjusted  by  appending  zeros  to  the  data  before  taking  the 
transforms. 

Additionally,  if  the  desired  signal  is  unstable  -  i.e.,  fading  in 
and  out  due  to  the  effects  of  the  propagation  medium,  propagation  paths, 
or  just  being  transmitted  intermittently  -  and  therefore  not  present  for 
at  least  lengths  of  time  equal  to  T,  then  it  is  possible  that  the  inte- 
gration process  will  actually  attenuate  the  signal  rather  than  enhance 
it.   Redundancy  is  the  display  will  help  alleviate  the  intermittent 
transmission  problem  by  providing  the  operator  with  estimates  which  are 
not  totally  independent,  but  fade-in  and  fade-out  problems  due  to  propa- 
gation should  be  considered  in  choosing  the  integration  time  T. 
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III.   THE  SPECTRAL  ESTIMATE 

In  the  development  of  modern  communication  theory,  a  vast  body  of 
knowledge  involving  various  technical  disciplines  has  been  incorporated. 
The  area  of  digital  data  analysis  and  measurement,  for  example,  Includes 
the  widespread  use  of  spectral  analysis  techniques  made  feasible  by  the 
availability  of  the  fast  Fourier  transform  algorithm.   Increased  emphasis 
on  optimization  and  detection  techniques  also  necessitates  the  use  of 
modern  statistical  procedures.   The  very  foundation  of  communication 
engineering  includes  such  topics  as  probability  theory,  random  processes, 
and  information  theory.   This  section  will  draw  on  what  is  necessary  from 
the  theoretical  framework  of  modern  communication  theory  to  discuss  the 
problem  of  estimating  the  power  spectra  of  finite  length  records  and 
determining  their  confidence  level. 

A.   STATIONARY  STOCHASTIC  PROCESSES  AND  ERGODICITY 

A  process  is  termed  deterministic  if  it  does  not  contain  any  features 
of  randomness;  otherwise  it  is  termed  stochastic  (or  random).   Spectral 
analysis  is  concerned  mainly  with  a  class  of  stochastic  processes  called 
stationary  stochastic  processes.   A  stationary  stochastic  process  is  one 
whose  statistical  properties  do  not  change  with  time.   Signals  generated 
by  a  stationary  process  have  the  property  that  a  long  enough  segment  of 
any  signal  recorded  in  some  time  period  has  essentially  the  same  statis- 
tics as  another  segment  of  the  signal  observed  at  some  other  period. 
That  is,  a  signal  from  a  stationary  process  is  not  tied  down  to  any  par- 
ticular origin  in  time. 

There  are  several  definitions  of  stationarity  in  current  use.   A 
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process  is  said  to  be  stationary  in  the  strict  sense  if  none  of  the 
statistics  which  characterize  the  stochastic  process  changes  with  time. 
Another  definition  of  stationarity  is  wide-sense  or  weak  stationarity . 
A  process  is  said  to  be  weakly  stationary  if  the  mean  value  and  auto- 
correlation function  do  not  vary  with  time.   In  other  words,  for  the 
weakly  stationary  random  process,  the  mean  value  is  a  constant  and  the 
auto-correlation  function  is  dependent  only  upon  the  displacement  t  and 
not  on  the  particular  time  chosen  to  compute  the  auto-correlation. 

If  each  signal  of  the  ensemble  of  possible  signals  that  can  be  gen- 
erated by  a  stationary  process  is  typical  of  the  ensemble  as  a  whole, 
then  the  process  is  said  to  be  ergodic.   The  statistics  over  a  long  time 
interval  for  any  one  signal  are  then  the  same  as  the  statistics  over  the 
ensemble  of  all  possible  signals  at  any  one  time  instant.   Thus  an  ergod- 
ic process  is  one  for  which  corresponding  ensemble  averages  and  time 
averages  are  equal. 

The  class  of  ergodic  processes  is  a  subclass  of  the  class  of  station- 
ary processes.   That  is,  an  ergodic  process  is  necessarily  stationary  but 
a  stationary  process  need  not  be  ergodic.   The  application  of  theory  to 
practice  is  very  oftei)  simplified  by  making  the  assumption  that  a  pro- 
cess is  ergodic;  however,  except  in  special  instances  it  would  be  diffi- 
cult or  impossible  to  demonstrate  that  a  physical  process  is  ergodic 
other  than  by  direct  measurements  and  comparisons. 

B.   DEFINITION 

The  following  discussion  will  examine  the  definition  of  the  spectral 
estimate,  the  error  involved  in  trying  to  make  a  spectral  estimate  from  a 
finite  length  sample,  and  v;ays  of  improving  this  inherent  error.   To  avoid 
confusion,  the  terms  "spectral  estimate"  or  "estimate"  in  this  discussion 
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refer  consistently  to  the  estimate  of  each  individual  spectral  component 
within  the  bandwidth  of  interest,  B;  and  the  term  "spectrum"  will  be  used 
to  refer  to  all  the  individual  estimates  collectively. 

In  order  to  proceed  with  estimating  the  spectrum  of  ocean  ambient 
noise,  some  assumptions  about  the  properties  of  the  noise  will  have  to  be 
made.   It  will  be  assumed  in  this  discussion  that  the  ocean  ambient  noise 
may  be  considered  approximately  Gaussian  and  at  least  weakly  stationary. 
Some  recent  research  has  shown  that  these  assumptions  are  reasonable 
(Parkins,  1968). 

In  the  context  of  digital  signal  processing,  the  power  spectral  den- 
sity or  power  spectrum  of  a  signal  is  defined  as  the  magnitude  squared  of 
the  Fourier  transform  of  the  signal.   Inherent  in  the  definition  of  the 
Fourier  transform,  however,  is  the  fact  that  a  signal  of  finite  length 
will  have  a  transform  which  contains  all  frequencies  and,  conversely, 
a  band-limited  spectrum  emanates  from  a  signal  which  extends  from  t  =  -  °° 
to  t  =  +  oo  .   Therefore,  in  order  to  talk  about  a  finite  spectrum,  the 
definition  of  the  true  power  spectrum  of  a  record  n(t)  is  defined  as 


G(f)  =  lim  |e 

T-H=o 


iN(f,T)|^  (3.1) 


where  N(f,T)  is  the  finite  Fourier  transform  of  n(t),  that  is, 

T         _.2TTft 

N(f,T)  =  r  n(t)  e  ^     dt  (3.2) 

0 

An  estimate  of  the  true  spectrum  given  in  Equation  3.1  for  a  sample 

record  of  any  length  T  may  be  obtained  by  simply  omitting  the  limiting 

and  expectation  operators.   This  will  yield  an  estimated  spectrum  of 

G(f,T)  =  I  |N(f,T)|^  b.3) 
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where  the  frequency  resolution  will  be  — .   Tliis  estimate,  as  it  stands, 
will  be  shown  to  be  statistically  unreliable,  however.   Methods  of  improv- 
ing the  reliability  will  be  presented  in  Section  HIE. 

C.   STATISTICS  OF  THE  SPECTRAL  ESTIMATE 

As  we  discussed  in  Section  II,  N(f,T)  is  a  complex  quantity  composed 
of  the  real  and  imaginary  Fourier  coefficients  and  |N(f,T)|   is  the  sum 
of  the  squares  of  the  Fourier  coefficients.   Now,  since  n(t)  is  consider- 
ed to  be  a  Gaussian  random  process  with  zero  mean,  and  since  the  Fourier 
transform  is  a  linear  operation,  it  can  easily  be  shown  (see  Section  IV) 
that  the  Fourier  coefficients  are  independent  Gaussian  random  variables 
with  zero  mean  and  equal  variances. 

Blackman  and  Tukey  (Pg  21)  show  that  each  frequency  estimate  of  G(f) 
is  a  chi-square  variable  with  two  "degrees  of  freedom".   The  number  of 
degrees  of  freedom,  K,  may  be  thought  of  as  representing  the  number  of 
independent  or  "free"  squares  entering  into  the  estimate.   Note  that  Equa- 
tion 3.3  describes  the  estimated  spectrum  and  that  each  spectral  compon- 
ent is  a  chi-square  variable  with  two  degrees  of  freedom.   The  probability 
density  function  of  \/'     is  given  by 

P  ?  (x)  =  [2^^'^   r(k/2)]  "^  x(k/2-l)  ^-x/2  ^3^^^ 

•  ^k 

where  r(a)  =   x^""*"  e"""  dx  (3.5) 

The  mean  and  variance  of  the  variable  y,   are 

k 

1^. 


V^MXu']  =  °2    =  2k  (3.7) 

^k 
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Note  that  the  variance  of  the  spectral  estimate  is  independent  of  the 
record  length  T.   That  is,  increasing  the  record  length  does  not  reduce 
the  variance  of  the  estimate;  it  only  increases  the  number  of  spectral 
components  in  the  spectrum.  Since  the  record  length  is  a  measure  of  the 
sample  size  for  the  estimate,  this  implies  that  the  PSD  estimates  given 
by  Equation  3.3  are  inconsistent .   To  be  consistent,  an  estimate  should 
approach  the  parameter  being  estimated  with  a  probability  approaching 
unity  as  the  sample  size  becomes  large.   That  is,  for  any  e  >  0, 

lim  Prob  [ |g  -  g|  >  e]  =  0  (3.8) 

T-K» 

Because  the  estimates  given  by  Equation  3.3  are  inconsistent,  the 

spectrum  computed  from  that  equation  is  called  the  "raw  spectrum"  and  the 

components  of  the  spectrum  are  called  the  "raw"  components.   It  will  be 

shown  that  more  nearly  consistent  estimates  can  be  made  by  "smoothing" 

the  raw  estimates. 

D.   ERROR  ASSOCIATED  WITH  THE  SPECTRAL  ESTIMATE 

The  accuracy  of  parameter  estimates  based  upon  sample  values  is  usu- 
ally described  in  terms  of  the  mean  square  error  defined  by 

Mean  square  error  =  E  [(G  -  G)^]  (3.9) 

It  can  easily  be  shown  that  the  mean  square  error  is  made  up  of  two 
parts;  the  variance  of  the  estimate  (random  error)  and  the  square  of  the 
bias  error  of  the  estimate,  that  is. 

Mean  square  error  =  Var  [G]  +  b^[G]  (3.10) 

An  estimate  is  said  to  be  unbiased  if  ElG]  =  G.   Therefore,  the  bias 
error  of  the  estimate  is  defined  as 

Bias  error   =  b[G]  =  E[G]  -  G  (3.11) 
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In  Bendot  and  Piersol  (Pg  187),  the  bias  error  for  the  spectral  esti- 
mate is  shown  to  be  proportional  to  the  second  derivative  of  the  true 
spectrum .   Therefore,  for  a  relatively  smooth  spectrum  over  the  effective 
bandwidth  B  of  each  estimate,  the  bias  error  is  usually  negligible  com- 
pared with  the  random  portion  of  the  error,  VAR[G] .   The  mean  square 
error  is  thus  approximately  equal  to  the  square  of  the  standard  error 
where  the  standard  error  is  defined  as 

Standard  error  =  VAR[G]  =  o[G]  (3.12) 

If  this  error  is  expressed  as  a  fraction  of  the  quantity  being  esti- 
mated, it  is  called  the  normalized  standard  error,  r.  ,  and  is  given  by 

e^=  °i|i  (3.13) 


Remembering  that 


and 


E  [x/]=  E[G]  =  k  -  G  (3.14) 


VAR  [Xj^^]  =  VAR  [G]  =  2k  (3.15) 


we  have 


e  =  /  r  (3.16) 

r     k 

And  since  K  =  2  for  the  raw  spectral  estimate,  we  have  a  standard  error 
of  unity,  which  says  that  the  standard  deviation  of  the  estimate  is  as 
large  as  the  quantity  being  estimated  —  which  is  obviously  unacceptable. 
The  smoothing  operations  presented  in  the  next  section  will  increase  the 
number  of  degrees  of  freedom  associated  with  each  estimate  and  thus  de- 
crease the  standard  error  considerably. 

E.   SMOOTHING  THE  SPECTRAL  ESTIMATE 

The  method  of  reducing  this  error  is  called  "smoothing"  and  may  be 
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accomplished  in  one  of  two  ways.   The  first  way  is  to  smooth  over  an 
ensemble  of  estimates.   This  can  be  done  by  computing  individual  esti- 
mates from  q  sample  records  and  then  averaging  the  q  estimates  at  each 
frequency.   The  second  way  is  to  smooth  over  frequency.   This  can  be  done 
by  averaging  together  the  results  of  Jl  contiguous  spectral  components  in 
the  estimate  from  a  single  sample  record.   In  either  case,  the  smoothing 
operation  approximates  the  expectation  operation  in  equation  3.1  and 
hence  makes  the  estimates  consistent. 

For  ensemble  smoothing,  the  estimates  from  q  independent  (i.e.,  non- 
overlapping)  time  slices  are  averaged  together.   If  each  time  slice  is  of 
length  T' ,  then  the  length  of  the  original  sample  is  T  =  qT'  and  the  final 
smoothed  estimate  is  given  by 

G.  =  -  [  G.  ,  +  G.  _  H-  .  ..  +  G.   ]  (3.17) 

1    q    1,1     1,2  i,q 

where  G.    is  the  raw  estimate  at  frequency  f.  of  the  q    time  slice. 
i,q  M     7   ^        n 

Noting  that  Y   +  Xi   =  X  .,  we  see  that  the  smoothed  estimate  will  be  a 
a     b     a+b 

chi-squared  variable  with  roughly  2q  degrees  of  freedom.   The  smoothing 

operation  did  not  affect  the  bandwidth  of  the  estimates;  hence  each  esti- 

ot 
mate  will  have  an  effective  bandwidth  of  B  =  —  ,  where  a  depends  on  the 

e   T 

particular  window  being  used.   Hence,  by  ensemble  averaging 

B^  =  f.  =  ^  (3.18) 

and 

^.'/i-/\  ".19) 

The  filter  shape  of  each  smooth  estimate  will  be  the  same  as  for  the 
raw  estimate  which  is  determined  by  the  Fourier  transform  of  the  original 
data  window  used.   The  estimate  G.  is  considered  as  representing  the 
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midpoint  of  the  frequency  interval  covered  by  the  filter  (B   is  the  effec- 
tive bandwidth  of  the  filter).   Figure  3.1  illustrates  the  process  of  ob- 
taining smooth  spectral  estimates  using  ensemble  averaging. 

For  frequency  smoothing,  the  estimates  of  i    contiguous  spectral  com- 
ponents from  the  raw  spectrum  are  averaged  to  form  the  smooth  estimate 
(the  raw  spectrum  referred  to  was  computed  from  a  time  record  of  length 
T) .   Hence,  the  i   spectral  component  G.  of  the  smoothed  spectrum  will 
be  given  by 

■•2  2   J 

where  G.  is  the  i    component  of  the  raw  estimate  and  it  is  assumed  that 

£  is  odd.   G,  will  be  a  chi-squared  variable  with  roughly  2Z  degrees  of 
freedom  by  the  same  reasoning  used  with  ensemble  smoothing.   The  effec- 
tive bandwidth,  B'  ,  of  the  smoothed  estimate  will  depend  on  the  value  of 
i   and  will  be  given  by 

B'   =  ^  "^  "^"^  (3.21) 

e       i 

where  the  effective  bandwidth  of  each  raw  estimate  used  in  the  average 
was  B  =  a/T.   Again  the  value  of  a  depends  on  the  data  window  used.   The 
filter  shape  of  the  smoothed  estimate  will  be  approximately  trapezoidal 
as  shown  in  figure  3.2.   The  process  of  obtaining  a  smooth  estimate 
through  frequency  averaging  is  shown  in  figure  3.3.   The  estimate  G.  is 
considered  as  representing  the  midpoint  of  the  frequency  interval  covered 
by  B  .   A  total  of  N/jl  independent  estimates  can  be  obtained  in  the 
smoothed  spectrum.   The  standard  error  of  the  estimates  obtained  by  fre- 
quency averaging  is  given  by 

=  /K    =  /l 
^r  /k   /  Z 
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4  n(t) 


(a)  Sample  record  segmented  into  three  pieces 

•   *  • 


(b)  Raw  spectrum  from  segment  one 


i^' 


(c)   Rav/   spectrum  from   segment   tv/o 


->-f 


iG. 


•   >   < 


(d)  Raw  spectrum  from  segment  three 


.,G  =  ^(G^+G2+G^) 


■^  f 


(e)  Smoothed  spectrum  by  ensemble  smoothing  with  q=3 


Figure  3.1.  Example  of  ensemble  smoothing 
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For  combined  smoothing  using  both  frequency  and  ensemble  averaging, 
the  final  effective  bandwidth  of  each  estimate  will  be  approximately 

^  e  ~  T' 

and  the  resulting  G.  will  be  a  chi-squared  variable  with  roughly  2jlq 
degrees  of  freedom.   The  normalized  standard  error  c   is  given  by 

e  =/=-  =    J-j-  (3.24) 

r   /  k    /  £q 

Even  though  the  stability  of  the  estimates  may  be  improved  by  either 
ensemble  or  frequency  smoothing  or  a  combination  of  the  two,  it  is  advis- 
able to  use  only  ensemble  smoothing  in  a  system  designed  to  detect  narrow- 
band signals.   The  frequency  resolution  of  the  estimated  spectrum  will 
generally  be  determined  to  be  of  the  same  order  of  magnitude  as  the  sig- 
nal to  be  detected,  and  taking  longer  transforms  and  frequency  smoothing 
would  be  a  very  inefficient  use  of  the  available  resources. 

F.   INCREASED  STABILITY  THROUGH  OVERLAPPING  SEGMENTS 

The  term  "degrees  of  freedom"  introduced  in  Section  IIIC  as  a  measure 
of  the  stability  of  the  estimate  was  said  to  refer  to  the  number  of  inde- 
pendent or  "free"  squares  that  were  summed  to  form  the  estimate.   A 
slightly  relaxed  definition  of  the  number  of  degrees  of  freedom  of  an 
estimate,  called  the  "equivalent  number  of  degrees  of  freedom",  can  be 
used  by  noting  that  for  a  chi-squared  variable 

K=  f^  (3.25) 

This  definition  of  K  will  be  used  to  see  if  the  stability  of  the  estimates 
may  be  increased  still  further  by  overlapping  the  segments  used  in  the 
ensemble  smoothing  operation. 
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initial  spectral  windov; 


>-  f 


(a)  Filter  shape  before  frequency  smoothing 


final  spectral  window 


(b)  Filter  shape  after  frequency  smoothing 


Figure  3.2.  Filter  shape  before  and  after  freq. smoothing 
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in(t) 


(a)  Sample  record  used  for  the  spectral  analysis 


G(f) 


•        « 


(b)  Raw  spectrinn  from  transform  of  n(t)  between 
t  =  0  and  t  =  T 


G(f) 


•     » 


(c)  Smoothed  estima.te  by  frequency  smoothing  with  J^=3 


Figure  3.3.  Example  of  frequency  smoothing 
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Referring  to  the  notation  of  section  2.2.5  and  figure  2.17,  we  see 
that  each  smooth  estimate  is  computed  by 


G(f)  =1  I  |G.(f) 
i=l 


(3.26) 


where  G  (f)  is  the  finite  Fourier  Transform  of  the  ith  v7indowed  section 
i 

and  P  is  the  number  of  segments  used.   That  is 


G.(f) 


^f(i-l)s 

n(t)  W[t  -  J  -    (i-l)s]e  ^  dt, 


(3.27) 


(i-l)s 


1  <  i  <  P 


The  spectral  estimate  G(f)  in  equation  3.26  is  a  random  variable. 
Its  mean  and  variance  were  computed  by  Nut  tall  (1971)  and  are  given  by 


E[G(f)]  = 


where 


J 

—00 


^    (t) 

W 


G(f  -  v) |w(v) I  dv  =  G(f) 

(P-1) 

I 
i=-(P-l) 

W(t)  W(t  -  T)dt 


W(v)   dv 


Var[G(f)]  =  G^f)  I  I      (l  -  iAi)  |  <|,^(is)  | 


(3.28) 


(3.29) 


(3.30) 


under  the  assumption  that  n(t)  is  a  Gaussian  random  process  and  that  the 

2 
frequency  resolution  of  the  spectral  window  |w(f) [   is  narrower  than  the 

finest  detail  in  the  true  spectrum  G(f). 

Equation  3.28  shows  that  the  mean  of  the  spectral  estimate  is  equal 

to  the  convolution  of  the  true  spectrum  G(f)  with  the  spectral  window 

I     |2 

|W(f) I  .   Equation  3.29  expresses  the  variance  of  the  spectral  estimate 

in  terms  of  the  number  of  pieces  P,  the  shift  S,  and  the  autocorrelation 

^     of  the  data  window.   Using  the  results  given  in  equations  3.28  and 

3.29,  we  can  compute  the  equivalent  number  of  degrees  of  freedom  of  our 
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estimate  in  terms  of  the  number  of  pieces  used  and  the  autocorrelation 
of  the  data  window  as  follows: 


K  - 


2E  [g(f)] 
Var[G(f)] 


(3.31) 


But 


2P[  W(v)   dv] 


i=-(p-l) 


w 


f     2 
<f)  (0)  =   W  (t)dt 


w 


(3.32) 


(3.33) 


K 


2P 


P-1 


I        (l-J^) 
i=-(P-l) 


6  (iS) 


V7 


(f)  (0) 
w 


(3.34) 


Notice  that  K  is  independent  of  the  value  of  frequency  f  and  true  spec- 
trum G(f).   It  depends  only  on  the  particular  data  window  W  and  the  num- 
ber of  pieces  P.   Noting  that  the  shift  S  is  uniquely  determined  as  a 
function  of  P,L,  and  T,  as   shown  in  equation  2.44,  we  have 


K  = 


2P 


P-1 

1  +  2  I  a  -b 

i=l 


*w^^  -  PTI) 


(3.35) 


^^(0) 


It  is  interesting  to  note  that  for  a  given  T/L,  a  maximum  value  of 
K  occurs  at  some  value  of  overlap  less  than  100  percent.   The  value  of 
overlap  is  a  function  of  the  particular  data  window  used  and  is  relatively 
independent  of  the  value  of  T/L.   As  described  in  section  IIC,  for 
P  =  T/L,  K  =  2P.   As  P  increases  for  a  given  T/L,  the  value  of  K  will 
increase  but  at  a  continually  decreasing  rate  because  the  overlapped 
pieces  are  progressively  more  statistically  dependent.   K  will  eventually 
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reach  a  maximum  value  and  then  decrease  slightly  as  the  amount  of  overlap 
approaches  100  percent.   A  typical  curve  of  K  versus  percent  overlap  is 
shown  in  figure  3.4.   Note  that  the  amount  of  overlap  required  to  achieve 
90  or  95  percent  of  maximum  stability  is  considerably  less  than  that  re- 
quired for  100  percent.   The  extra  computational  effort  required  for 
highly  overlapped  processing  should  be  given  serious  consideration  before 
implementing  the  maximum  value  of  K. 

The  rectangular  data  window  is  described  by 

W(t)  =  u(t)  -  u(t-L) 
where  u(t)  is  a  step  function. 

The  autocorrelation  function  as  defined  in  equation  3.30  is  computed 
to  be 

(f-d)  =  L  -  |t|        (|t|  _<  L)  (3.36) 

and 


*^=  1  -M        (Ul  <  L) 
(}.(0)    ^  L         HT|  _  W 


(3.37) 


For  the  Hanning  window 

W(t)  =  |-(1  -  cos  — ^)[u(t)  -  u(t  -  L)]  (3.38) 

thus   .         (j)(T)  =  i(L-T)(l+  4cos  ^)  +  -4-   sin^  (3.39) 

H  Z       L       lb  TT       L 

(hi  IL) 

.   (  h  I  1  L) 

Tables  3.1  -  3.4  show  how  K  varies  as  a  function  of  P  for  several 
values  of  T/L,  as  given  by  equation  3.35,  for  the  rectangular  and  Hanning 
windows . 
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50  100 

Percent  overlap 


■*-S 


Figure  3.4.   Typical  K  vs  S  curve 
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RECTANGULAR 

HANNING 

p 

S(%) 

K 

K 

4 

0 

8.0 

8.0 

5 

25 

9.09 

9.99 

6 

40 

9.47 

11.91 

7 

50 

9.80 

13.36 

8 

57.1 

9.99 

14.09 

9 

62.5 

10.05 

14.29 

10 

66.7 

10.11 

14.28 

11 

70.0 

10.15 

14.20 

12 

72.7 

10.15 

14.13 

13 

75.0 

10.16 

14.06 

14 

76.9 

10.17 

14.00 

15 

78.6 

10.16 

13.94 

16 

80.0 

10.15 

13.90 

17 

81.2 

10.15 

13.85 

18 

82.4 

10.14 

13.81 

19 

83.3 

10.14 

13.78 

20 

84.2 

10.13 

13.75 

50 

93.9 

9.98 

13.38 

100 

97.0 

9.90 

13.26 

200 

98.5 

9.86 

13.20 

300 

.  99.0 

9.85 

13.18 

400 

99.2 

9.84 

13.17 

500 

99.4 

9.83 

13.16 

TABLE  3.1 

K  vs  P  FOR  T/L  =  4 

62 


RECTANGULAR  BANNING 

S  (7o)  K  K 

16.0 

18.0 

20,0 

21.99 

23.92 

25.71 

27.27 

28.52 

29.45 

30.09 

30.49 

30.72 

30.83 

30.87 

30.87 

30.85 

30.82 

30.79 

30.30 

30.03 

29.89 

29.84 

29.81 

29.80 


8 

0 

16.0 

9 

12.5 

17.51 

10 

22.2 

18.37 

11 

30.0 

18.91 

12 

36.4 

19.32 

13 

41.7 

19.69 

14 

46.2 

20.06 

15 

50.0 

20.46 

16 

53.3 

20.76 

17 

56.2 

20.95 

18 

58.8 

21.06 

19 

61.1 

21.16 

20 

63.2 

21.25 

21 

65.0 

21.35 

22 

66.7 

21.46 

23 

68.2 

21.55 

24 

69.6 

21.61 

25 

70.8 

21.64 

50 

85.7 

21.95 

100 

92.9 

21.93 

200 

96.5 

21.87 

300 

.  97.7 

21.84 

400 

98.2 

21.83 

500 

98.6 

21.82 

- 

TABLE  3.2   K 

.  vs  P  FOR  T/L 
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RECTANGULAR  HANNING 

P  S(7o)         -       K  K 

16  0  32.0  32.0 

17  6.2  33.75  34.0 

18  11.8  35.08  36.0 

19  16.7  36.10  38.0 

20  21.1  36.89  40.0 
25  37.5  39.37  49.77 
30  48.3  41.36  57.67 
35  55.9  42.87  62.07 
40  61.5  43.49  63.70 

45  65.9  44.05  64.06 

46  66.7  44.19  64.070 

47  67.4  44.31  64.074 

48  68.1  44.40  64.073 

49  68.7  44.48  64.067 

50  69.4  44.55  64.06 
100  84.8  45.66  63.59 
200  92.5  45.85  63.30 
300  95.0  45.85  63.20 
400  96.2  45.84  63.15 
500          .   97.0  45.83  63.12 


TABLE  3.3   K  vs  P  FOR  T/L  =  16 
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RECTANGULAR  MANNING 

S(%)  K  K 

64.0 

66.0 

68,0 

70.0 

72.0 

74.0 

76.0 

78.0 

80.0 

99.61 

115.85 

125.43 

129.31 

130.36 

130.52 

130.53 

130.53 

130.52 

130,52 

130.52 

130.50 

130.45 

130.40 

130.35 

130.30 

130.11 

129.92 

129.82 

129.76 


32 


32 

0 

64.0 

33 

3.1 

65.88 

34 

6.1 

67.52 

35 

8.8 

68,96 

36 

11.4 

70,22 

37 

13.9 

71.32 

38 

16.2 

72.30 

39 

18.4 

73.16 

40 

20.5 

73.93 

50 

36.7 

79.08 

60 

47.5 

83.16 

70 

55.1 

86.53 

80 

60.8 

87.94 

90 

65.2 

89.11 

100 

68.7 

90.29 

101 

69.0 

■90.37 

102 

69.3 

90.43 

103 

69.6 

90.49 

104 

69.9 

90.55 

105 

70.2 

90.61 

110 

71.6 

90.84 

120 

74.2 

91.29 

130 

76.0 

91.82 

140 

77.7 

92.08 

150 

-  79.2 

92.29 

200 

84.4 

93.07 

300 

89.6 

93.55 

400 

92,2 

93.70 

500 

93.8 

93,75 

TABLE  3.4 

K  vs  P  FOR  T/L 
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G.   CONFIDENCE  INTERVAL  FOR  THE  SPECTRAL  ESTII-IATE 

Since  the  sampling  distribution  of  the  smoothed  estimate  is  approxi- 
mately chi-square  with  K  degrees  of  freedom,  we  have  a  (1  -  a)  confidence 
interval  for  a  power  spectral  density  function  G(f)  (based  upon  an  esti- 
mate G(f)  measured  with  a  resolution  bandwidth  B   and  a  record  length  T) 
given  by 


KG(f) 
2 
%:  a/2 


<  G(f)  < 


KG(f) 


^K;  1  -  a/2 


(3.41) 


The  confidence  statement  is  interpreted  as  follows:   The  true  PSD  G(f) 
will  fall  within  the  noted  interval  with  a  confidence  coefficient  of 
(1  -  a),  or  in  more  common  terminology,  with  a  confidence  of  100(1  -  a) 
percent. 
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IV.   PROCESSING  GAIN  OF  A  FOURIER  PROCESSOR  WITH  INTEGRATION 

A.   INTRODUCTION 

Section  III  discussed  the  spectral  estimate  by  the  use  of  the  fast 
Fourier  transform  technique  and  the  method  of  improving  the  estimates  for 
a  given  record  of  finite  length.   It  was  shown  that  the  measure  of 
quality  or  reliability  of  the  estimate  is  the  mean  square  error  (or  vari- 
ance) and  that  this  error  can  be  made  arbitrarily  small  if  a  longer' 
record  were  available.   This  situation  often  creates  a  three-way  com- 
promise among  effort,  resolution  and  stability  (precision)  of  the 
estimate. 

This  section  will  interpret  the  stability  of  the  spectral  estimates 
in  terms  of  processing  gain  within  the  context  of  a  Fourier  signal  pro- 
cessor.  Figure  4.1  shows  a  model  of  the  system.   Its  mission  is  to  de- 
tect the  presence  in  a  random  noise  background  of  discrete  narrowband 
signals  with  random  phase  variation.   The  decision  element  of  the  syetem 
is  assumed  to  be  a  human  operator  monitoring  a  display  similar  to  the 
one  discussed  in  section  II  c.   The  sensitivity  of  the  processor  is  us- 
ually defined  in  terms  of  a  minimum  detectable  signal  for  a  given  pro- 
bability of  detection  and  false  alarm.   Because  of  the  presence  of  the 
human  operator  in  the  system,  the  level  of  minimum  detectable  signal  is 
difficult  to  determine  and  will  not  be  discussed.   A  brief  look  at  the 
theory  of  signal  detection  by  a  human  operator  will  be  coverd  in  section 
V.   It  should  be  noted  that  the  "processing  gain"  discussed  in  this  sec- 
tion is  derived  under  rather  idealized  conditions.   The  effects  of 
redundency  and  Hanning  weighting  are  considered  but  losses  due  to  hard- 
ware limitations  are  not  discussed.   Losses  due  to  actual  implementation 
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Figure  4.1.  Fourier  processor  with  integration 
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have  been  found  to  be  small  if  careful  consideration  is  given  to  the 
areas  discussed  in  section  II. 

B.   PROCESSING  GAIN  DEFINED 

When  talking  about  processing  gain  of  a  Fourier  signal  processor,  it 
is  convenient  to  view  the  DFT  as  a  bank  of  N  independent  narrow  band 
filters,  where  N  is  the  number  of  equally  spaced  samples  (each  of  width  A) 
within  the  time  of  L  the  data  to  be  transformed.   The  center  of  each 
"filter"  will  be  located  at  multiples  of  the  fundamental  frequency 
f  =  —  Hz  and  the  shape  of  each  filter  will  depend  upon  the  data  window 
used  as  previously  discussed.   Regardless  of  which  windoV'Z  is  used,  however, 
maximxim  power  transfer  will  occur  only  if  the  desired  signal  if  a  multiple 
of  the  fundamental  frequency.   If  the  signal  is  not  an  even  multiple  of 
f  ,  then  it  will  be  attenuated  and  its  energy  spread  throughout  the 
spectrum  (the  extent  of  the  leakage  will  again  depend  oii  which  window 
is  used)  as  shown  in  figure  2.12. 

The  processing  gain  (PG)  of  a  digital  processor  which  utilizes  the 
DFT  as  the  initial  filtering  device  and  then  incoherently  integrates 
the  output  of  each  filter  (i.e.,  ensemble  smoothing)  will  be  made  up  of 
two  distinct  parts,  one  part  due  to  the  DFT  itself  (PG  )  and  one  part  due 
to  the  incoherent  integration  (PG  ) .   Heuristically ,  this  makes  sense, 
since  the  signal-to-noise  power  ratio  (SNR)  will  obviously  be  improved 
by  passing  the  signal  through  a  narrowband  filter.   And  then  the  results 
can  be  improved  even  more  by  averaging  the  random  noise  power  over  time 
since  the  average  signal  power  is  approximately  constant.   Therefore, 
the  processing  gain  for  the  system  is  defined  as 

PG  =  PG   •  PG  (A.l) 
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For  simplicity,  the  processing  gain  will  be  derived  in  terms  of  a 
single  discrete  frequency  component  in  an  additive  vhite  gaussian  noise 
background.   The  assumption  of  a  white  noise  background  is  really  not 
very  restrictive  since  all  that  is  required  is  that  the  power  spectrum 
of  the  noise  be  flat  over  the  bandwidth  of  each  filter.   Further,  it 
will  be  assumed  that  the  signal  to  be  "detected"  will  be  an  integer 
multiple  of  the  fundamental  frequency,  f  =  1/L,  so  that  maximum  power 
transfer  is  achieved  by  the  filter.   This  is  a  reasonable  assumption 
since  we  are  interested  in  the  maximum  gain  available  from  the  system. 
Also,  it  will  be  assumed  that  the  signal  is  stable  in  frequency  through- 
out the  interval  of  observation  and  that  it  is  actually  present  in  the 
background  during  that  time.   Exceptions  to  the  above  assumptions  will 
be  discussed  later. 

The  processing  gain  of  each  section  of  the  processor  will  be  de- 
fined as  the  ratio  of  the  signal-to-noise  power  at  the  output  of  the 
section  to  the  signal-to-noise  power  at  the   input  of  the  section.   For 
standardization,  the  noise  power  used  is  usually  the  average  power  of 
the  noise  in  a  1  Hz  band.   Thus  for  the  j    filter  of  the  DFT  (see 
figure  A. 2), 

SNR^ 

(4.2) 


SNR^ 
SNRj^ 

. 

.  ^''t.j 

"~ 

and 

for 

the 

integrat( 

Dr 

SNR, 
4 

SNR^ 

'\i 

^ 

(4.3) 

Notice  that  the  signal-to  noise  ratio  at  the  output  of  the  j    filter 
is  the  input  signal-to-noise  ratio  to  the  integrator  (i.e.,  SNR  =  SNR  ) 

PG„  .-PG^  .  =  7—-^  .  -TTT^  =  T^TTVT^  =  PG,  (4.4) 


T,j    I,j    SNR^    SNR^    SNR^     j 


the  processing  gain  for  the  entire  system. 
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INPUT 


SNR^  SNR^ 


INTEGRATE 


SNR, 


OUTPUT 


Figure  4.2.  Reference  for  signal-to-noise  ratios 
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For  purposes  of  the  following  analysis,  only  the  processing  gain  of 
a  typical  filter  will  be  derived.   Results  for  the  remaining  N-1  filters 
of  the  processor  may  be  generalized  from  this  one  by  considering  v;hether 
or  not  a  signal  is  present  within  the  bandwidth  of  the  filter. 


C.   PROCESSING  GAIN  OF  THE  DFT  (PG   .) 

■*-  >  J 

The  j        filter   output   of   the   DFT   is    defined   as 


1   \-'-         -i2TTJk/N  ,,     ^, 

k=0 


3        N  ,^.   k 


where  s  ,  k  =  0,1,2 ,... ,N-1  are  the  N  sample  values  of  the  input  signal 
to  be  transformed. 

With  noise  only  present  at  the  input,  the  output  of  the  j    filter 
may  be  written  as 

c.  =  N  Z  n  e  (A. 6) 

J    "^  k=0   '^ 

where  n,  ,  k  =  0, 1,2 , . . . ,N-1  are  sample  values  from  a  zero  mean  Gaussian 

random  process.   In  terms  of  the  real  and  imaginary  coefficients,  the 

complex  quantity  c.  can  be  written  as 

c.  =  a.  +  i  b.  (4.7) 

J    J      J 


where 


N-1 


•  a.  =  Re(c.)  =  ^  I  n  cos(27TJk/N)  (A. 8a) 

_   N-1 
b.  =  Im(c.)  =^     I     n      sin(27TJk/N)  (4.8b) 

Now  since  the  DFT  is  a  linear  operator,  the  random  variables  a.  and  b. 

are  each  zero-mean  Gaussian  random  variables  when  the  input  signal  is 

noise  alone.   To  show  that  a.  and  b.  are  also  independent,  we  need  only 

3  3 

to  show  that  E[a.b.]  =  0. 
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Now 

N-1  .    N-1 


E[a.b.]    =   E[(|     5;      n.     cos    2TTJk/N)  (-  -     I      n      sin    2kj^/N)]       (4.9) 
J    J  k=0  1^0 

N-1   N-1 
=  --  I        I   E[n  n    ]    cos(2^jk/N)    sin(27TJ£/N)  (4.10) 


1  N-1  7 

-„        5;      E[n,     ]    cos(2TTJk/N)    sin(27TJk/N)  (4.11) 

N^  k=0  ^ 

2  N-1 
^       j;     sin(47rjk/N)      .  (4.12) 

N  k=0 


E[a.b.]    =   0  (4.13) 

2 

where   a      is    the   variance    (average   power)    of   the   input   noise   signal. 

Therefore,    a.    and  b.    are   independent    zero-mean   Gaussian   random  variables. 
J  J 

They   also  have   equal  variances    as   shown  below: 


2  ,    N-1  N-1 

Var[a.]    =  E[a.    ]    =   E[(-     I  cos   27TJk/N)  (-     I     n      cos    2TTJJi/N)]         (4.14) 

J  J  ^   k=0      "^  ^    Z=:0      ^ 

N-1  N-1 
=   E[-  5;        5;     n  n      cos    (2Tijk/N)    cos    (2Trj£/N)]  (4.15) 

N       k=0   £=0      ^  ^ 

T        N-1  ■ 
=  -.        5;     E[n,     ]    cos    (27TJk/N)  (4.16) 

N       k=0  ^ 

2      N-1 
=  -^       I      1  +  cos(4Trjk/N)  (4.17) 

2N        k=0 


And 


2 

a 

2N 

N-1  ,    N-1 


(4.18) 


Var[b.]    =   E[b^J    =   E[(|     I     n,     sin   27TJk/N)  (^     I     n.    sin   27TJil/N)]         (4.19) 
J  J  ^  k=0      ^  ^   £=0      ^ 
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N-1 
=  -    I      E[n      ]    sin  (2TTjk/N)  (A.  20) 

N   k=0    ^ 

2      N-1 
=  -~       I      1  -  cos  (27TJk/N)  (4.21) 

2N   k=0 

=15  (A.22) 

Now  the  power  spectral  component  of  the  j    filter  is  defined  as 

G.  =  a,^  +  b.^  .  (4.23) 

3  3  3 

and  the  average  power  in  that  bin  is  therefore  given  by 

Avg  Pwr  =  E[G.]  =  E[a.^]  +  E[b.^]  (4.24) 


2 

a 


(4.25) 


Now  if  a  signal  is  present,  the  sample  values  s  ,  k  =  0, 1,2 , . . . ,N-1 

K. 

are  each  the  sum  of  a  zero  mean  Gaussian  random  variable  and   a  constant 

t"  In 

A  cos    (2TrfkA  +  tj))  .      So   the  j         filter   output    can  be  written   as 

N-1 
^-    ="  h     I      K   +  ^  cos    (27TfkA  +   (^)]e~^^^^^'^^  (4.26) 

J        ^  k=0        ^ 

N-1  N-1 

1     r  -i27T'ik/N    ,    1     v      »  /n    ^i  a    ,    .%    -i2TTjk/N  .,    _^. 

C.    =  —     2,     1^1,^  +  M     Z,      ^  cos    (27ffkA  +  (f))e  "^  (4.27) 

J   ^  k=0  ^        ■   ^  k=0 


which  can  be  seen  to  be  the  sura  of  two  independent  parts,  one  due  only  to 
the  noise  and  one  due  only  to  the  signal.   The  noise  portion  was  handled 
above.   Consider  now  only  the  portion  of  the  output  due  to  the  signal. 
Then, 

^  Y   cos    (27rfkA  +  (j>)e-^2^J^/^  (4.28) 


=  -  y 


N-1 

I    I 
k=0 


A     V        i(27rfkA  +  (j))      -i(27TfkA  +  <^).   -i2  tt  jk/N      (4.29) 
'2N     ^ 
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Now  under  the  assumption  that  the  signal  is  in  the  center  of  the  bin  we 
have  f A  =  j/N.  So 


C.    .  ^Y   e'U  e-'^'^^'^/''  ^  ^^  (4.30) 

J>s   2N  ,^^ 


A  id) 
=  2  ^ 

The   total   output    of   the  j        bin  with  signal  plus   noise   at   the   input    is 

thus   given  by 


^  A     i(J)  _^   1      r.-*-  -i2TTJk/N  ,,    ^_ 


N-1 
=    (—  cos 


—  COS    (t  +  —     y     n,     cos   2TTJk/N,       .  ,A      ,       ^    ,    1      V  .      ^    -i   /m\ 

2  ^        N  ^^Q     k  -"    '    )    -x(-^  s±n    (^  +  -     I     n^  sm   2Trjk/N) 

k=0 

(4.33) 

The  power  spectral  component  at  frequency  j/N  is  given  by 

G.  =  |C. 1^  (4.34) 

^2  N-1  .  N-1  N-1 

^n   =   4  "^  N  '^^^    *   5^  n,  (2Trjk/N)   +  -  sin    <}.   j;  n      sin(2Trjk/N)   +  -       J  n 
^        ^  k=0  '^  ^  k=0  ^  N     k=0   ^ 

(4.35) 
and  the   average  power  at    that    frequency   is   given  by 

Avg  Pwr  =   E[G.]  (4.36) 

,2  2 

since  E[N  J  =  0.   Notice  that  the  average  power  in  the  j    bin  is  made  up 
of  two  components,  one  due  only  to  the  noise  and  one  due  only  to  the  sig- 
nal.  This  is  not  an  unexpected  result  since  the  noise  and  the  signal  were 
uncorrelated  and  therefore,  the  power  should  be  additive. 


75 


Referring  to  equation   4.2,   we   are  now   ready   to   compute   PG      ..      The 

•'■ » J 

signal-to-noise  ratio  of  the  input  signal  is  given  by 

SNR  =  -A^  (A.  38) 

2a 

2  2 

where   a      is   the   average  power    (variance)    of   the  noise   and  A  /2   is    the 

Rl-IS   signal  power.      From  4.37   it   is    obvious    that 

2 

AN        N 

SNR     =  -^  =   2   ^^\  ^^-^^^ 

4a 

therefore  PG    =  gNi;  =  2  (^-^O) 

which  is  a  realistic  result  since  the  original  noise  power  has  been  ef- 
fectively separated  into  N/2  bins  (see  symmetry  conditions  of  the  DFT 

in  section  IIB)  and  the  average  siganl  povzer  is  totally'  contained  within 

•  th  ,  . 
the  J    bin. 


D.   PROCESSING  GAIN  OF  THE  INTEGRATOR  AND  TOTAL  SYSTEM  GAIN 

The  processing  gam  due  to  incoherent  integration  m  the  j    bin  is 
due  entirely  to  the  variance  reduction  (or  smoothing)  of  the  noise.   This 
may  be  seen  as  follows: 

SNR  =  SNR  =  -^^  (4.41) 

and        SNR.  =  -Ar^  (4.42) 

where  a   and  a   are  the  variances  in  the  j    bin  due  to  the  noise  at 
stages  three  and  four,  respectively,  of  figure  4.2.   Referring  to  Equation 
4. 3  we  see  that 

SNR,    a^ 

PG    = =   -4r  (4.43) 

I,j    SNR„     2 

4 
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The  variance  of  the  smoothed  spectral  estimate  was  computed  by 
Nuttall  (1971)  and  is  given  by 


2     (P-1)       I  ,         2 
Var[G]  =  -^^^  I        (1  -  -J^)|(j)  (iS)|  (A. 44) 

^   i=-(P-l)    ^    ^ 


P   \i*w^°^l'  "^  \^    ^^  -   l^l^w^^^^l  ^       ^"^-"^^^ 


.0!(f)/,wn.,2    '-' 


where  G(f)  =  true  spectrum  of  noise 

(|)  (t)  =  autocorrelation  function  of  the  data  window 
w 

P  =  number  of  raw  transforms  averaged  to  obtain 
the  estimate 

S  =  amount  of  overlap  used  in  the  P  transforms  in 
seconds . 

The  reader  is  referred  to  figure  2.17  and  section  III  F  for  a  review  of 

how  the  above  parameters  relate. 

Now,  for  the  raw  spectral  estimate  (i.e.,  estimate  available  at  stage 

3  of  figure  4.2)  P  =  1  and  we  have  no  overlap.   Therefore 

o^  =   G^(f)|4  (0)|  (4.46) 

J  w 

For  the  smooth  spectral  estimate  (i.e.,  estimate  at  the  output  of 

the  integrator) ,  P>>>1  and  we  have 


o/  =-^^(lKO)|^  +  2  j  (1  -  f)U^(iS)l'^        (4.47) 

Therefore    PC   .  =  (4.48) 

'^  P      .   <J)„(iS)  2 

1  +  25;   (1-  -) 
i=l 


w 


4(0) 
w 


K 
2 


(4.49) 
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where  K  is  the  equivalent  number  of  degrees  of  freedom  of  the  estimate 

(see  section  3).   Note  that  if  P  =  T/L,  there  is  no  overlap  and  PG^  .=  P. 

I,  J 

For  P  >  T/L,  the  processing  gain  will  be  greater  than  P.   Combining 
equation  4.49  and  equation  4.40,  the  system  processing  gain  is  given  by 

NK 
PG  =  ^  (4.50) 

4 

To  maximize  the  system  processing  gain  for  a  given  T/L,  we  want  to 

overlap  the  transforms  so  as  to  maximize  the  value  of  K.   Figure  4.3 

shows  that  K    is  linear  as  a  function  of  T/L  if  the  Banning  window 
max 

is  used  on  the  data.   Similar  results  could  be  obtained  for  any  data 

window  using  equation  3.35.   Figure  4.4  shows  the  amount  of  overlap 

required  to  obtain  K    ,  .99K    and  .95K    as  a  function  of  T/L. 

max      max         max 

Note  that  whi].e  the  amount  of  overlap  required  to  obtain  K    increases 

max 

slightly  with  T/L,  the  overlap  required  to  obtain  .99K    or  .95K 
is  nearly  independent  of  T/L.   The  slight  discontinuities  in  figure  4.4 
at  small  values  of  T/L  are  due  to  the  large  percentage  overlap  changes 
with  the  addition  of  one  piece  (or  transform).   Thus,  for  the  Hanning 
window  it  would  seem  that  60  %  overlap  would  be  the  optimum  overlap 
from  a  processing  gain  point  of  view.   Figure  4.5  shows  how  the  value  of 
K  varies  as  a  function  of  T/L  if  other  than  optimum  overlap  is  used. 
From  figure  4.3  v/e  see  that 

K    =  4.15  (v)  -  2.20  (4.51) 

max         L 

So 

PG  =  N(1.0375  T/L  -  .55)  (4.52) 


or  in  decibels. 


PG  -  10  log  N  +  10  log^Q(1.0375  T/L  -  .55)  db    (4.53) 
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Figure   4.3.    K^  ^vs   T/L  for  Hanning  window 
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Figure  4.4.   Percent  overlap  vs  T/L  for  Hanning  window 
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Figure  4.5.  EDF  vs  percent  overlap  for  Banning  window 


Usually  one  would  expect  N>>T/L,  so  that  the  first  term  of  equation 
will  be  the  most  dominant  factor  in  figuring  the  processing  gain.   For 
example,  if  N  =  8192  and  T/L  =16, 

PG  =  39.13  +  12.05  =  51.18  db  (4.54) 

E.   NON-IDEAL  CONDITIONS 

In  the  above  discussion  of  processing  gain,  it  was  assumed  that  the 
signal  frequency  remained  constant  over  the  entire  period  of  observation, 
If  this  is  not  the  case,  then  both  PG  and  PG  will  be  less  than  ideal. 
For  example,  in  the  case  of  the  transform,  assume  that  the  input  fre- 
quency is  f  =  j /L  for  the  first  N/2  samples  and  then  abruptly  jumps  to 
f  =  (j+l)/L  for  the  remaining  N/2  samples.   Then,  the  processing  gain 
(PG  )  for  both  the  j    and  (j+1)    filters  is  N/8,  or  a  loss  of  6  db 
over  the  ideal  case. 

Next  assume  that  the  frequency  is  a  constant  f  =  j /L  for  t  seconds 
of  the  total  T  seconds  of  integration,  and  then  abruptly  jumps  to  f  = 
(j+l)/L  for  the  remaining  T-t  seconds.   Then  the  average  power  in   the 
j   bin  will  be  attenuated  by  a  factor  of  t/T  and  thus  PG  for  the  j 
bin  will  be  attenuated  by  a  factor  of  t/T  (i.e.,  a  loss  of  10  log  „ (t/T) 

St 

db)  from  the  ideal  case.   The  loss  from  the  ideal  case  for  the  (j+1) 

bin  will  be  10  log  „  (T-t)/T  db .   Thus  we  see  that  if  the  integration 

time  is  significantly  larger  than  the  time  the  signal  spends  in  the  bin, 

then  we  can  quickly  eliminate  any  gain  we  might  have  obtained  through 

the  DFT.   For  example,  if  T  =  256  and  t  =  80,  the  loss  in  the  j    bin 

amounts  to  15  db  over  the  ideal  case.   Careful  condideration  should 

therefore  be  given  to  the  integration  time  and  frequency  resolution  to 

to  use  for  the  particular  problem  at  hand.   Very  fine  resolution  and  a 

long  integration  time  will  not  aid  in  detection  of  a  signal  which  is 

unstable  in  frequency  and  possibly  fading  in  and  out  besides. 
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V.   SIGNAL  DETECTION  AND  THE  HUMAN  OPERATOR 

A .  INTRODUCTION 

In  the  mid-1950's  the  theory  of  statistical  decision,  motivated  by 
problems  in  radar,  was  translated  into  a  theory  of  signal  detection. 
Although  the  theory  specified  an  ideal  process,  the  generality  of  the 
theory  suggested  that  it  might  also  be  relevant  to  the  detection  of  sig- 
nals by  human  observers.   The  relevant  theory  seemed  to  provide  a  frame- 
V7ork  for  a  realistic  description  of  the  behavior  of  the  human  observer 
in  a  variety  of  perceptual  tasks.   A  brief  overview  of  the  general  the-, 
ory  and  a  description  of  certain  elements  of  the  signal  detection  theory 
appropriate  to  the  human  observer  are  included  here  to  point  out  the 
flexibility  that  a  human  operator  can  provide  in  a  signal  detection 
system. 

Statistical  decision  theory  specifics  the  optimal  behavior  in  a 
situation  where  one  must  choose  between  two  alternative  statistical 
hypotheses  on  the  basis  of  an  observed  event  or  series  of  events.   In 
particular,  it  specifies  the  optimal  cutoff,  along  the  continuum  on 
which  the  observed  events  are  arranged,  as  a  function  of  (a)  the  a 
priori  probabilities  of  the  two  hypotheses,  (b)  the  values  and  costs 
associated  with  the  various  decision  outcomes,  and  (c)  the  amount  of 
overlap  of  the  distributions  that  constitute  the  hypotheses.   In  short, 
the  detection  problem  is  a  problem  in  statistical  decision;  it  requires 
testing  statistical  hypotheses. 

B.  FUNDAMENTAL  DETECTION  PROBLEM 

In  the  fundamental  detection  problem,  an  observation  is  made  of 
events  occurring  in  a  fixed  interval  of  time,  and  a  decision  is  made, 
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based  on  this  observation,  V7hether  the  interval  contained  only  the  back- 
ground interference  or  a  signal  as  well.   The  interference,  v;hich  is 
random^  is  referred  to  as  noise  and  is  denoted  by  N;  the  other  alterna- 
tive is  termed  signal  plus  noise  and  is  denoted  by  SN.   In  the  funda- 
mental problem,  only  these  tvo  alternatives  exist  —  noise  is  alvjays  pre- 
sent, whereas  the  signal  may  or  may  not  be  present  during  a  specified 
observation  interval.   Actually,  the  observer,  who  has  advance  knowledge 
of  the  ensemble  of  signals  to  be  presented,  says  either  "yes,  a  signal 
was  present"  or  "no,  a  signal  was  not  present"  following  each  observa- 
tion.  The  fundamental  problem  will  later  be  extended  to  include  a  third 
alternative  -  "I'm  not  sure,  continue",  but  for  now  consider  only  the 
fundamental  case. 

C.  REPRESENTATION  OF  SENSORY  INl^ORMATION 

Any  observation  may  arise,  with  specific  probabilities,  either  from 
noise  alone  or  from  signal  plus  noise.   This  is  shown  graphically  in 
figure  5.1.   The  observation  is  labeled  x  and  plotted  on  the  abscissa. 
The  left-hand  distribution,  labeled  f  (x) ,  represents  the  probability 
density  that  x  will  result  given  the  occurrence  of  noise  alone.   The 
right-hand  distribution,  f   (x) ,  is  the  probability  density  function  of 
X  given  the  occurrence  of  signal  plus  noise.   Since  the  observation  will 
tend  to  be  of  greater  magnitude  when  a  signal  is  presented,  the  mean  of 
the  SN  distribution  will  be  greater  than  the  mean  of  the  N  distribution. 
In  general,  the  greater  the  amplitude  of  the  signal,  the  greater  will 
be  the  separation  of  these  means. 

D.  OBSERVATION  AS  A  VALUE  OF  A  LIKELIHOOD  RATIO 

In  the  discussion  thus  far  it  has  been  assumed  that  the  observation 
may  be  represented  along  a  single  axis.   That  this  is  a  valid  assump- 
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Figure  5.1.  The  probability  density  function  of  noise 
and  signal  plus  noise. 
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tion  —  in  spite  of  the  fact  that  the  response  of  the  visual  system  pro- 
bably has  many  dimensions  -  is  explained  in  terms  of  some  concepts  that 
are  basic  to  the  theory. 

Assume  that  the  response  of  the  sensory  system  has  several  dimen- 
sions, and  represent  it  as  a  point  in  an  m-dimensional  space.   Call  this 
point  y.   For  each  such  point  in  this  space  there  is  some  probability 
density  that  it  resulted  from  noise  alone,  f  (y) ,  and  similarly  some 
probability  density  that  it  was  due  to  signal  plus  noise,  f   (y) . 

o  IN 

Therefore,  there  exists  a  likelihood  ratio  for  each  point  in  the  space, 
^ (y)  "  f cMCy)/^^^^^ '  expressing  the  likelihood  that  the  point  y  arose 
from  SN  relative  to  the  likelihood  that  it  arose  from  N,   Since  any 
point  in  the  space,  i.e.  any  sensory  datum,  may  be  thus  represented  as  a 
real,  non-zero  number,  these  points  may  be  considered  to  lie  along  a 
single  axis.   The  observation  x  is  thus  identified  with  A.(y)  and  the 
decision  axis  becomes  the  likelihood  ratio. 

The  assumption  of  a  unidimensional  decision  axis  is  independent  of 
the  characteristics  of  the  signal  and  noise.   Rather,  it  depends  on  the 
fact  that  just  two  decision  alternatives  are  considered.   More  generally, 
it  can  be  shown  that  the  number  of  dimensions  required  to  represent  the 
observation  is  M-1,  where  M  is  the  number  of  decision  alternatives  con- 
sidered by  the  observer. 

Having  established  that  the  observation  x  may  be  identified  with  the 
likelihood  ratio  A.  (y)  ,  it  can  also  be  shown  that  we  may  equally  well 
identify  x  with  any  monotonic  transformation  of  A- (y) ;  i.e.  nothing  is 
lost  by  distorting  the  linear  continuum  as  long  as  order  is  maintained. 
In  particular,  if  it  is  assumed  that  a  transformation  of  A.  (y)  results  in 
Gaussian  density  functions  on  x,  the  problem  will  be  greatly  simplified 
as  will  be  seen  shortly.   The  existence  of  such  a  transformation  was 
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assumed  in  the  representation  of  the  density  functions,  f   (x)  and 
f  (x)  in  figure  5.1.   A  further  assumption  is  that  the  SN  function  is 
simply  a  translation  of  the  N  function;  i.e.,  that  the  two  density 
functions  have  equal  variances.   This  has  been  found  to  be  not  strictly 
true,  but  for  the  purpose  of  this  discussion  the  small  difference  in  the 
variances  will  not  be  relevant. 

To  summarize  the  last  few  paragraphs,  it  has  been  shown  that  an 
observation  may  be  characterized  by  a  value  of  a  likelihood  ratio, 
A.  (y) ,  i.e.,  the  likelihood  that  the  response  of  the  sensory  system  y 
arose  from  SN  relative  to  the  likelihood  that  it  arose  from  N.   This 
permits  viewing  the  observations  as  lying  along  a  single  axis.   It  was 
then  assumed  that  a  particular  transformation  of  X (y)  would  result  in 
normal  probability  density  functions  on  x.   The  observer  is  assumed  to 
base  his  decisions  on  the  variable  x. 

E.   DEFINITION  OF  THE  CRITERION 

From  the  definition  of  the  problem  we  see  that  the  observer  attemp- 
ting to  detect  signals  in  noise  is  faced  with  two  alternate  hypotheses. 
He  must  decide  from  which  hypothesis  the  observation  resulted,  N  or  SN. 
The  observer  must  establish  a  policy  that  defines  the  circumstances 
under  which  the  observation  will  be  regarded  as  resulting  from  each  of 

the  two  possible  events.   He  will  establish  a  criterion,  a  cutoff  x  on 
^  '  c 

the  continuum  of  observations,  to  which  he  can  relate  any  given  observa- 
tion x.  .   If  he  finds  for  the  i   observation,  x. ,  that  x.  >  x  ,  he  says 
1  lie 

"SN";  of  x.  <  X  ,  he  says  "N" . 
1    c       ■' 

The  observer's  decision  will  be  one  of  four  outcomes:   he  will  say 
"SN"  or  "N"  and  will  in  either  case  be  right  or  wrong.   The  decision 
outcome,  in  other  words,  will  be  a  "hit",  a  "miss",  a  "correct  rejec- 
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tion",  or  a  "false  alarm".   Clearly,  these  four  possibilities  are  inter- 
dependent.  For  example,  an  increase  in  the  probability  of  a  hit  can  be 
achieved  only  by  accepting  an  increase  in  the  probability  of  a  false 
alarm  and  decrease  in  the  other  probabilities.   Thus,  the  balance  de- 
sired by  an  observer  in  any  instance  will  determine  the  optimal  location 

of  his  criterion,  x  .   Tlie  relationship  of  these  four  probabilities  to 

c 

one  another  is  shown  in  figure  5.2,  where  the  probability  of  a  "hit"  is 

represented  by  region  f„,,  •  A,  the  probability  of  a  "miss"  by  f„   •  B 

SN  SN 

(or  p) ,  the  probability  of  a  "correct  rejection"  by  f   •  B,  and  the  pro- 
bability of  a  "false  alarm"  by  f   'A  (or  o^) . 

■^   N 

Tliere  are  two  conditional  probabilities  of  principal  interest.   These 

are  the  conditional  probability  of  the  observer  saying  "SN"  given  the 

occurrence  of  a  signal,  P   (A) ,  and  the  probability  of  saying  "SN"  given 

the  occurrence  of  noise  alone,  P„(A).   These  two  are  sufficient  since  the 

N 

other    two   are   merely   their    complements:       P„„(B)    =    1    -    P„  ,(A)    and    P„(B)    = 

SN  SN         N 

1  -  P  (A) .   The  conditional  and  joint  probabilities  are  related  as 
follows: 


p   (A)  =  P^SN  '  A) 

^SN^^''     p(SN)  ^^'^^ 


V^^=-^%f^  (5-2) 


where  p(SN)  is  the  a  priori  probability  of  signal  occurrence  and  p(N)  = 
1  -  p(SN)  is  the  a  priori  probability  of  occurrence  of  noise  alone. 
With  reference  to  figure  5.2,  note  that  "P^^i^)    is  the  integral  of  f<.>^(x) 
over  the  critical  region  A  and  P^(A)  is  the  integral  of  f^j(x)  over  A. 

In  statistical  decision  theory,  and  in  the  theory  of  signal  detecta- 
bility,  the  optimal  criterion  for  a  decision  is  specified  in  terms  of  the 
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Figure  5.2.  The  relationship  between  the  four  possible 
outcomes  of  a  simple  detection'  decision. 
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likelihood  ratio.   That  is  to  say,  it  can  be  shown  that  the  optimal  cri- 
terion can  always  be  specified  by  some  value  G  of  X (x) .   In  other  words, 
the  critical  region  A  that  corresponds  to  the  criterion  contains  all 
observations  with  likelihood  ratio  greater  than  or  equal  to  9,  and  none 
of  those  with  likelihood  ratio  less  than  S .   To  see  that  this  is  true, 
note  that  the  expected  value  of  a  decision  (denoted  EV)  is  defined  in 
in  statistical  decision  theory  as  the  sum,  over  the  potential  outcomes 
of  a  decision,  of  the  products  of  the  probability  of  outcome  and  desira- 
bility of  outcome.   Thus,  using  the  notation  V  for  positive  individual 
values  of  the  observer  and  K  for  costs  on  negative  individual  values, 
we  have  the  following  equation: 

^^  =  ^SN-A  P^^^*^^  -^  \.B  ^^^*^^  -  ^SN'B  P^^^*^^  -  ^.A  ^^^'^^     ^^'^^ 

If  a  priori  and  conditional  probabilities  from  equations  5.1  and  5.2 
are  substituted  for  the  joint  probabilites  in  equation  5.3,  then  collec- 
ting terms  yields  the  results  that  maximizing  EV  is  equivalent  to  maxi- 
mizing 


where 


It  can  be  shown  that  this  value  of  9  is  equal  to  the  value  of  the  like- 
lihood ratio  that  corresponds  to  the  optimal  criterion.   From  equation 
5.4  it  is  seen  that  the  value  9  simply  weights  the  hits  and  false  alarms 
and  equation  5.5   shows  that  9  is  determined  by  the  a  priori  probabili- 
ties of  occurrence  of  signal  and  of  noise  alone  and  by  the  values  asso- 
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ciated  with  the  individual  decision  outcomes.   Referring  to  figure 
5.2  and  keeping  in  mind  the  result  that  9  is  a  critical  value  of  ^ (x) , 
it  should  be  clear  that  the  optimal  cutoff  x  along  the  x  axis  is  at  the 
point  on  the  axis  where  the  ratio  of  the  ordinate  value  of  f   (x)  to  the 

ordinate  value  of  f„(x)  is  a  certain  number,  n-mely  9. 

N 

The  value  of  9  above  specifies  the  optimal  weighting  of  hits  rela- 
tive to  false  alarms;  x  should  always  be  located  at  the  point  on  the  x- 

c 

axis  corresponding  to  9.   Just  where  this  value  of  x  v^ill  be  with  re- 
ference to  the  x-axis  depends  not  only  upon  the  overlap  of  the  two  den- 
sity functions  (i.e.,  upon  the  signal  strength). 

F.   RECEIVER  OPERATING  CHARACTERISTICS 

Whatever  criterion  the  observer  actually  uses,  even  if  it  is  not  an 
optimal  criterion,  it  can  be  described  by  a  single  number  —  some  value 
of  likelihood  ratio.   By  observing  how  the  observer's  performance  is 
evaluated  with  respect  to  the  location  of  his  criterion,  it  can  be  seen 
how  his  performance  may  be  evaluated  with  respect  to  his  sensory  capa- 
bilities . 

As  previously  noted,  the  fundamental  quantities  in  the  evaluation  of 
the  operators  performance  are  P^(A)  and  p   (A) .   Figure  5.3  shows  a 
graph  of  p   (A)  versus  p  (A) ,  each  arc  representing  a  different  amount 
of  overlap.   Each  curve  in  figure  5.3  corresponds  to  a  given  separation 

between  the  means  of  the  density  functions  f ,(x)  and  f„_(x).   The  para- 

.N         SN 

meter  of  these  curves  is  labeled  d',  where  d'  is  defined  as  the  differ- 
ence between  the  means  of  the  two  density  functions  expressed  in  terms 
of  their  standard  deviation,  that  is: 

y    (x)  -  y   (x) 
d-  .  ^N N_  (5.6) 


°f,(x) 
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If  the  decision  criterion  is  set  at  the  left  of  figure  5.2,  we  will 
obtain  a  point  in  the  upper  right-hand  corner  of  figure  5.3;  both  p   (A) 
and  P„(A)  are  unity.   If  the  criterion  is  set  at  the  right  end  of  the 
decision  axis  in  figure  5.2,  a  point  at  the  other  extreme  of  figure  5.3 
results;  both  p   (A)  and  P,,(A)  are  equal  to  zero.  Between  these  two 
extremes  lies  the  criterion  of  practical  interest.   The  form  of  the 
curves  in  figure  5.3  is  not  the  only  one  which  might  result,  but  it  is 
the  form  v/hich  will  result  if  the  observer  chooses  a  criterion  in  terms 
of  likelihood  ratio,  and  the  probability  density  functions  are  normal 
and  of  equal  variance. 

In  the  context  of  the  detection  problem,  figure  5.3  is  usually  re- 
ferred to  as  the  receiver-operating-characteristic,  or  ROC  curve.   The 
optimal  "operating  level"  is  the  point  on  a  curve  where  its  slope  is 
equal  to  6  (see  equation  5.5).   That  is,  the  expression  p   (A)  -  6p  (A) 
defines  a  utility  line  of  slope   ,  and  the  point  of  tangency  of  this 
line   to   the  ROC  curve  is  the  optimal  operating  level.   Thus  the  theory 
specifies  the  appropriate  "hit"  probability  and  "false  alarm"  probability 
for  any  definition  of  optimum  and  any  set  of  parameters  characterizing 
the  detection  situation. 

Since  the  separation  between  the  means  of  the  two  density  functions 
is  a  function  of  signal  amplitude,  d'  is  an  index  of  the  detectability 
of  a  given  signal  for  a  given  observer.   The  variable  d'  is  a  measure 
of  the  observer's  sensory  capabilities  or  the  effective  signal  strength. 
The  criterion   that  is  employed  by  the  observer,  which  determines  the 
p  (A)  and  Pcf.(A)  for  some  fixed  d',  reflects  the  effect  of  variables 
which  have  been  variously  called  the  set,  attitude,  or  motives  of  the 
observer. 
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Figure  5.3.   Typical  ROC  curves 
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G.   SEQUENTIAL  OBSERVATIONS 

If  additional  observations  are  allowed  before  a  decision  is  required, 
then  the  observer  is  assumed  to  adopt  two  criteria,  or  three  critical 
regions.   These  regions,  at  least  those  pertaining  to  the  first  observa- 
tion, are  represented  as  A,  B,  and  C  in  figure  5.4.   If  the  first  obser- 
vation falls  in  A  the  observer  says  "SN",  if  it  falls  in  B  he  says  "N", 
and  if  it  falls  in  C  he  says  "Continue".   The  theory  of  sequential  analy- 
sis assumes  that  the  successive  observations  are  integrated  and  that  the 
integrated  value  is  compared  with  the  three  critical  regions  at  each 
observation  stage.  Thus,  a  decision  is  made  whenever  the  combined  evidence 
for  one  or  the  other  of  the  two  hypotheses  is  sufficiently  persuasive. 
It  is  clear  that  the  relationship  depicted  in  figure  5.4  holds  only  for 
the  first  observation  in  a  sequence.   A  second  observation  is  required 
only  if  the  first  falls  in  the  intermediate  region,  C.   The  combination 
of  the  first  and  second  observations,  therefore,  will  not  be  normally 
distributed  ,  for  it  results  from  the  addition  of  a  normal  variable  and 
a  normal  variable  with  both  tails  missing.   Analysis  of  the  inter-rela- 
tionships of  the  parameters  is  extremely  difficult  in  the  sequential 
observation  problem  because  of  this. 

H.   USE  OF  IDEAL  DESCRIPTIONS  AS  MODELS 

At  first  glance  it  might  appear  that  a  theory  of  ideal  behavior  is 
really  not  very  useful  when  talking  about  a  human  observer.   This  is  not 
the  case,  hov/ever;  ideal  behavior  constitutes  a  convenient  base  from 
which  to  explore  the  complex  behavior  of  a  real  organism.   Ideal  con- 
ditions generally  involve  few  variables.   Having  identified  the  per- 
formance to  be  expected  under  ideal  conditions,  it  is  possible  to  extend 
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Figure  5.4.  Representation  of  the  sequential  decision 
problem. 
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the  model  to  include  additional  variables  associated  with  the  real 
world.   In  studies  using  human  observers  in  the  signal  detection  pro- 
blem, it  has  been  found  that  the  ideal  theory  closely  approximates 
human  behavior.   The  problem  is  one  of  altering  the  ideal  model  in 
some  way  so  that  it  is  slightly  less  than  ideal. 

The  human  observer  is  able  to  adjust  his  detection  criterion  to  a 
particular  problem  if  the  information  is  presented  to  him  in  the  correct 
manner.   That  is,  the  learning  capabilities  of  the  human  observer  can 
provide  a  great  deal  of  flexibility  to  a  signal  detection  system  if  he 
is  used  correctly.   The  value  of  this  flexibility  will  be  beneficial 
however,  only  if  the  values  of  the  observer  are  such  that  they  complement 
the  system.   The  attitude  and  motives  of  the  observer  play  a  very  impor- 
tant role  in  the  effectiveness  of  the  system.   Recent  advances  in  digital 
processing  techniques  and  computer  graphics  show  great  promise  in  con- 
trolling these  parameters  through  increased  processor  gain  and  a  man- 
machine  interface  never  before  possible  with  analog  equipment. 
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VI.  CONCLUSIONS  AND  RECOMMENDATIONS 

This  paper  has  discussed  digital  signal  processing  and  spectral  an- 
alysis techniques  as  applied  to  the  detection  of  discrete  narrowband  sign- 
als in  a  nonstationary  noise  background.   Specifically,  the  discussion 
was  slanted  toward  application  of  a  Fourier  processor  with  integration 
to  the  Navy's  ASW  effort.   There  is  a  real  need  today  for  improved  proces- 
sing techniques  and  it  is  apparent  that  digital  processors  are  the  most 
likely  means  to  improvement.   The  fast  Fourier  transform  algorithm  has 
led  to  extensive  revision  in  the  methods  of  frequency  domain  analysis 
and  real-time  digital  processors  to  the  point  that  bandwidths  that  are 
sufficient  for  most  problems  are  a  reality  today. 

It  appears  that  the  first  attempt  at  using  interactive  graphics  tec- 
hniques in  the  display  of  a  passive  ASW  signal  processor  is  being  made  in 
the  "Spotlight"  processor.   Preliminary  use  of  the  display  indicates  that 
the  operator  now  has  a  flexibility  and  an  insight  into  the  detection 
problem  never  before  possible  with  previous  displays.   This  particular 
combination  of  processing  and  display  has  tremendous  possibilities  in  the 
field  of  ASW  today. 

There  are  two  questions  fundamental  to  the  ASW  detection  problem, 
which  the  author  feels  are  of  primary  importance  in  determining  the  per- 
formance capabilities  of  a  Fourier  signal  processor  such  as  the  "Spot- 
light" processor.   The  first  is  the  question  of  the  effects  on  the 
spectral  estimates  of  integrating  for  relatively  long  periods  of  time 
over  information  which  is  known  to  be  from  a  non-stationary  background. 
The  second  is  the  human  factors  question  of  what  is  the  optimal  amount 
of  redundancy  required  in  the  display  for  a  processor  such  as  this. 
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Answers,  or  at  least  partial  answers,  to  these  questions  v;ould  aid 
tremendously  in  the  actual  implementation  of  this  type  of  equipment. 
Research  in  these  areas  is  highly  recommended  since  it  is  only  after 
these  questions  are  answered  that  the  full  potential  of  such  a  system 
can  be  evaluated. 
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